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Abstract 

We review the basic properties of the quantum relative entropy for finite- 
dimensional Hilbert spaces. In particular, we focus on several inequalities that 
are related to the second law of thermodynamics, where the positivity and the 
monotonicity of the quantum relative entropy play key roles; these properties 
are directly applicable to derivations of the second law (e.g., the Clausius in- 
equality). Moreover, the positivity is closely related to the quantum fluctuation 
theorem, while the monotonicity leads to a quantum version of the Hatano- 
Sasa inequality for nonequilibrium steady states. Based on the monotonicity, 
we also discuss the data processing inequality for the quantum mutual infor- 
mation, which has a similar mathematical structure to that of the second law. 
Moreover, we derive a generalized second law with quantum feedback control. 
In addition, we review a proof of the monotonicity in line with Petz |105] . 
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1 Introduction 

The quantum relative entropy [T|[2] was introduced by Umegaki [3], analogous to the 
classical relative entropy introduced by KuUback and Leibler In the the early 
days |5l|6], a prominent result on the quantum relative entropy was a proof of the 
monotonicity based on the celebrated Lieb's theorem (TU] and its application 

to the strong subadditivity of the von Neumann entropy [111112]. Ever since, the 
quantum relative entropy has been widely applied to quantum information theory [13}- 
[17] . Moreover, the quantum relative entropy is useful to describe the mathematical 
structure of the second law of thermodynamics [18]. In fact, it has been shown that 
the quantum relative entropy is closely related to recent progresses in nonequilibrium 
statistical mechanics. 

The fluctuation theorem is a remarkable result in modern nonequilibrium sta- 
tistical mechanics, which characterizes a symmetry of fluctuations of the entropy 
production in thermodynamic systems [T9H56] . From the fluctuation theorem, we 
can straightforwardly derive the second law of thermodynamics which states that 
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the entropy production is nonnegative on average. It has been understood that the 
derivation of the second law based on the fluctuation theorem is closely related to the 
positivity of the relative entropy [28l - [3H[33t H8] . In fact, without invoking the fluctu- 
ation theorem, we can directly derive the second law by using the positivity of the 
relative entropy [33l[571|58] . If the microscopic dynamics of a thermodynamic system 
is explicitly described by quantum mechanics, the fluctuation theorem is referred to 
as the quantum fluctuation theorem [53HSS], which is one of the main topics of this 
article. 

On the other hand, the monotonicity implies that the quantum relative entropy 
is non-increasing under any time evolution that occurs with unit probability in quan- 
tum open systems [H [21 [HI [161 [171 EH [60]. It has been known that the second law of 
thermodynamics can also be derived from the monotonicity of the classical or quan- 
tum relative entropy [SIHSS] • Moreover, the monotonicity can be applied to describe 
transitions between nonequilibrium steady states (NESSs). In such situations, the 
monotonicity leads to a second law-like inequality, which we refer to as a quantum 
version of the Hatano-Sasa inequality [62] . In the classical regime, such an inequality 
can also be derived from a generalized fluctuation theorem called the Hatano-Sasa 
equality. It was first discussed for an overdamped Langevin system [M], and has been 
applied to other situations and systems [65n69j . In this article, we discuss the quan- 
tum version of the Hatano-Sasa inequality, while the quantum Hatano-Sasa equality 
has not been fully understood [55] . 

The monotonicity of the quantum relative entropy is also useful in quantum in- 
formation theory [T31|Tni[I7]. In particular, the data processing inequality for the 
quantum mutual information can be directly derived from the monotonicity as is the 
case for the second law of thermodynamics. Moreover, based on the monotonicity, we 
can derive several important inequalities such as the Holevo bound, which identifies 
the upper bound of the accessible classical information that is encoded in quantum 
states [7nH72] . On the other hand, a "dual" inequality of the Holevo bound is related 
to a quantity which we refer to as the QC-mutual information (or the Groenewold- 
Ozawa information) [73l475l[83] . It plays a key role in the formulation of a generalized 
second law of thermodynamics with quantum feedback control [761491] . 

This article aims to be an introduction to the quantum relative entropy for finite- 
dimensional Hilbert spaces, which is organized as follows. 

In Sec. 2, we review the basic properties of quantum mechanics. In Sec. 2.1, 
we introduce quantum states and observables. In Sec. 2.2, we discuss dynamics of 
quantum systems, which is the main part of this section. In particular, we introduce 
the concept of completely-positive (CP) maps, and prove that any CP map has a 
Kraus representation in line with the proof by Choi [22] • 

In Sec. 3, we introduce the von Neumann entropy and the quantum relative en- 
tropy. In particular, we discuss the monotonicity of the quantum relative entropy 
under completely-positive and trace-preserving (CPTP) maps in Sec. 3.3. The strong 
subadditivity of the von Neumann entropy is shown to be a straightforward conse- 
quence of the monotonicity of the quantum relative entropy. 

In Sec. 4, we discuss the quantum mutual information and related quantities. In 
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Sec. 4.1, we introduce the quantum mutual information, and prove the data processing 
inequahty based on the monotonicity of the quantum relative entropy. In Sec. 4.2, 
we discuss the Holevo's x-Quantity, and prove the Holevo bound. In Sec. 4.3, we 
introduce the QC-mutual information and discuss its properties. In particular, we 
prove a "dual" inequality of the Holevo bound. We note that all of the three quantities 
reduce to the classical mutual information for classical cases. 

In Sec. 5, we discuss some derivations of the second law of thermodynamics and its 
variants. In Sec. 5.1, we discuss the relationship between thermodynamic entropy and 
the von Neumann entropy. In Sec. 5.2, we discuss a derivation based on the positivity 
of the quantum relative entropy. In Sec. 5.3, we discuss the quantum fluctuation 
theorem in a general setup, by introducing the stochastic entropy production. The 
quantum fluctuation theorem directly leads to the second law, which is equivalent to 
the derivation based on the positivity of the quantum relative entropy. In Sec. 5.4, 
we discuss a derivation of the second law based on the monotonicity. We also discuss 
relaxation processes toward NESSs, and derive a quantum version of the Hatano-Sasa 
inequality for transitions between NESSs. 

In Sec. 6, we discuss derivations of a generalized second law of thermodynamics 
with quantum feedback control, which involves the QC-mutual information. Our 
derivation is based on the positivity of the quantum relative entropy in Sec. 6.1, 
while on the monotonicity in Sec. 6.2. 

In Sec. 7, as concluding remarks, we discuss the physical meanings and the validi- 
ties of the foregoing derivations of the second law of thermodynamics in detail. 

In Appendix A, we briefly summarize the basic concepts in the linear algebra. In 
Appendix B, we prove the monotonicity of the quantum relative entropy in line with 
Petz [11)5] . 

2 Quantum States and Dynamics 

First of all, we review basic concepts in quantum mechanics. We consider quantum 
systems described by finite-dimensional Hilbert spaces. Let L{H, H') be the set 
of linear operators from Hilbert space H to H' . In particular, we write L{H) := 
L{H, H) (see also Table 1). We note that the linear algebra with the bra-ket notation 
is briefly summarized in Appendix A. 

2.1 Quantum States and Observables 

In quantum mechanics, both quantum states and observables (physical quantities) 
can be described by operators on a Hilbert space. Let H he a Hilbert space that 
characterizes a quantum system. We define Q{H) C L{H) such that any p G Q{H) 
satisfies 

p > and tr[p] = 1, (1) 



4 



Symbols Meanings 

L{H, H') The set of hnear operators from H to H' 
L{H) L{H,H) 
Q{H) {p e L{H) : p > 0, tr[p] = 1} 

Table 1: Symbols and their meanings. 



where p > means that {i^\p\il)) > for any G H (or equivalently, p is positive)]^] 
and tr[p] means the trace of p. We call p G Q{H) a density operator, which describes 
a quantum state. If the rank of p is one so that p = for \ip) G H, p is called 

a pure state and is called a state vectorjl We note that any state vector satisfies 

m) = 1. 

On the other hand, any Hermitian operator X G L{H) is called an observable, 
which describes a physical quantity such as a component of the spin of an atom. Any 
observable X is assumed to be measurable without any error in principle. Let X := 
'^k-'^k\fk){fk\ be the spectrum decomposition, where {Iv^fc)} is an orthonormal basis 
of H. By the error-free measurement of observable X, the measurement outcome 
is given by one of x^'s. For simplicity, we assume that Xk 7^ Xk' for k ^ k'; this 
assumption will be removed in Sec. 2.2.3. We note that k is also referred to as an 
outcome. The probability of obtaining x^ is given by 

p{k) := ((^fc|p|(^fc), (2) 

where p G Q{H) is the density operator of the measured quantum state. Equality ([2]) 
is called the Born rule. The sum of the probabilities satisfies 

= $^(<^fc|p|<^fe) = tr[p] = 1. (3) 

k k 

We note that p > and tr[p] = 1 respectively confirm p{k) > and ^kP{k) = 1. 
The average of the outcomes is then given by 

{X):=J2pik)xk = tT[Xp], (4) 

k 

which is a useful formula. If p = is a pure state, the Born rule ([2]) reduces to 

Pik) = \{^km', (5) 

and Eq. (HD to 

(X) = (^|X|^). (6) 

^In the present article, we follow the terminologies in Ref. [H] and say that a hermitian operator 
X is positive if X > and positive definite if X > 0. See appendix A for details. 

^In terms of the algebraic quantum theory using C*-algebras, the definition of pure states depends 
on the choice of the algebra that is generated by observables. The above definition is valid only if 
the set of observables equals the set of all Hermitian operators in L{H). In other words, we assumed 
that there is no superselection rule. 
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If there are two quantum systems A and B described by Hilbert spaces Ha and 
Hb, their composite system AB is described by the tensor product space Hb- 
Let p^^ e Q{Ha ® Hb) be a density operator of the composite system. The partial 
states corresponding to A and B are respectively given by 

p^ = tr^[p^^], p^ = tr^[p-^^], (7) 

where tr^ and tr^ describe the partial traces in H^ and Hb, respectively. For any 
observable G L{Ha) and the identity G L{Hb), we have 

tr^B[(X^ ® /^)p^''] = tr^[XV], (8) 

which is consistent with Eq. ([7]). If p^^ = p^®p^ is satisfied, p^"^ is called a product 
state. If a pure state is not a product state, it is called an entangled state. 

We next show that any quantum state can be written as a pure state of an 
extended system including an auxiliary system. Let K he a set of indexes and 
p = J2k€K Pkli^k) {i^kl £ Q{H) be a state, where \ipk)'s are not necessarily mutually- 
orthogonal. We introduce an auxiliary system R described by a Hilbert space Hpi 
with an orthonormal basis {\rk)}keK- By defining a pure state 

1^) := 5^ VP~k\^k) \rk) eH® Hr, (9) 

k&K 

we have 

p = trnm{n- (10) 
The state vector |\E') in Eq. is called a purification of p. We note that a purification 
is not unique. 



2.2 Quantum Dynamics 
2.2.1 Unitary Evolution 

The time evolution of an isolated quantum system is given by a unitary evolution. A 
density operator p G Q{H) evolves as 

p ^ UpU\ (11) 

where U G L{H) is a unitary operator satisfying U^U = UW = I. Any unitary 
evolution preserves the trace (i.e., trlUpW^] = tr[p]) and the positivity of p. In the 
continuous-time picture, the time evolution of a density operator is given by the von 
Neumann equation: 

^ = -i[H, Pit)] := -i{Hp{t) - p{t)H), (12) 

where H G L{H) is a Hermitian operator called the Hamiltonian of the system. We 
seih=l in this article. The unitary evolution from time to t is described by unitary 
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operator U{t) = e~^^^. If we control the system by changing some external classical 
parameters such as a magnetic field, the Hamiltonian of the system can depend on 
time. In such the von Neumann equation is given by 

^ = -imt),p{t)], (13) 

which leads to 

Uit) = / / ■ ■ ■ / dt^H{t,)H{t2) ■ ■ ■ H{t^) 

n=0 Jo Jo ^^^^ 

=: Texp l^-i^ H{t')dt'^ 
where "T" represents the time-ordered product. 
2.2.2 Completely Positive Maps 

For an open quantum system that interacts with another quantum system, the time 
evolution is not given by a unitary evolution in general. Moreover, the input state 
(the initial state) and the output state (the final state) can be described by different 
Hilbert spaces with each other. The time evolution is generally described by a liner 
map S : L{H) — )■ L{H'), where H and H' are the Hilbert spaces that describe the 
input and output systems, respectively. 

For example, we consider a time evolution in which the input state is in Q{H) and 
the output state is in Q{H'). If another quantum system described by H" comes 
to interact with the input system and we access the total output state, then the 
output system becomes larger than the input one; the output system is described by 
H' = H® H". 

The general condition for £ is given by the complete positivity. If £ occurs with 
unit probability, it needs to be trace-preserving. We will discuss these two properties 
in detail. 

Definition 2.1 £ is called positive if £{X) > for any X E L(H) such that X > 0. 
Moreover, £ is called completely positive (CP) if £ ® X„ is positive for any n G N, 
where X„ is the identity operator on L(C"). 

The positivity is enough to confirm the positivity of the output state in Q{H'). 
On the other hand, the complete positivity confirms the positivity of the density 
operator of the total system including the environment. In fact, if a time evolution 
was not CP, one might observe a negative probability in the total system. We note 
that an important example of £ that is positive but not CP is the transposition 
of operators with a matrix representation, which has been used for characterizing 
entanglements [93|[M]. 

The definition of the complete positivity can be rewritten as follows: £ is CP if 
L(i3"')-valued matrix {£{Xki))i<k i<n is positive for any L(i/)-valued positive matrix 
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Xn := iXki)i<k,i<n with Xki G L{H). The equivalence of the two definitions is 
confirmed as follows: X„ can be written as 

= J]X«(H)|efe)(e,| eL(Jf (8)C"), (15) 

kl 

where {|efe)}^^j is an orthonormal basis of C". We then have 

0X„)(X„) = Y,^{Xki) \ek){ei\ e L{H' ® C"). (16) 

kl 

We note that £ is called n-positive ii £ is positive for a n. 

Any time evolution in quantum systems needs to be CP. On the other hand, we 
introduce the second important property of £: 

Definition 2.2 We call £ : L{H) L{H') trace-preserving (TP) if tr[£{X)] = 
tr[X] for any X G L{H). 

This property confirms the conservation of the probability as tr[£^(p)] = 1 for 
p e Q{H). If £ is both CP and TP, it is called CPTP (completely positive and 
trace- preserving) . Any time evolution that occurs with unit probability needs to be 
CPTP. 

The foUowings are simple examples. 
Example 2.1 Any unitary evolution £{p) := UpW is CPTP. 

Example 2.2 Let p G L{Ha <S) Hb). The partial trace £{p) := tr b[p] is CPTP. In 
fact, £ is CP, because, for any positive operator a G L{Ha®Hb®'C^)-, {£ ®Xn){(T) ~ 
trB[a] is also positive. £ is TP, because tr^[trB[p]] = tr^s[p]- 

Example 2.3 Let pa G Q{Ha). A map £ : Q{H) Q{H (g) Ha) defined by 
£{p) ■= p^ Pais CPTP. 

We have the following example by combining the above three examples. 

Example 2.4 Let pa G Q{H a) be a state and U G Q{H ® Ha) be a unitary 
operator. A map £ : Q{H) Q{H) defined by £{p) := tr aWp (8) PaU^ is CPTP. 

The above example is a typical description of the dynamics in open quantum 
systems. It will be shown in Sec. 2.2.4 that any CPTP map from L{H) to L{H) can 
be written in this form. The next example is a generalization of the above example 
to the cases that the input system and the output system are different. 

Example 2.5 Let H and H' be Hilbert spaces corresponding to the input and the 

output systems, respectively. We introduce auxiliary systems described by Ha and 
Hb and assume that H ® Ha — H' ® Hb- Let pA G Q{Ha) be a state and 
U G L[H ® Ha) — L{H' Hb) be a unitary operator. Then a linear map £ : 
L{H) L{H) defined by £{p) := tTBpp ® paU^ is CPTP. 
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We next consider quantum measurement processes [T HI^SHTUn] . Suppose that we 
perform a measurement on a quantum system and obtain outcome k with probabihty 
p{k). We assume that the number of possible outcomes is finite. Let p G Q{H) be the 
pre- measurement state and pk G Q{H') be the post-measurement state with outcome 
k. We define a hnear map £k : Q{H) — )• Q{H') such that 

where the probabihty of obtaining outcome k is given by 

p{k)=ii[£k{p)]. (18) 

The map £k needs to be CP, but it is not TP if p{k) ^ 1. In general, a time evolution 
that does not occur with unit probability does not need to be TP. We note that £k 
needs to satisfy 

ii[£k{X)] < ti[X] (19) 

for any positive X G L{H), since p{k) < 1 must hold for any p G Q{H). The 
ensemble average of p^'s over all outcomes is given by J^kPWPk = J2k^k{p) ='■ £{p), 
where £ := Ylik^k is a CPTP map. We note that {£k} is called an instrument, which 
characterizes the measurement processl^l 

We note that £ : L{H) — )■ L{H') is called a unital map if it satisfies £{I) = I', 
where / and /' are the identities on H and H', respectively. 



2.2.3 Kraus Representation 

We next show that any CP map has a useful representation, which is called the Kraus 
representation. 

Theorem 2.1 (Kraus representation) A linear map £ : L(H) — )• L(H') is CP if 
and only if it can be written as 

£ip) = J2MkpMl (20) 

k 

where p G L{H), Mk G L{H, H'), and the sum in the right-hand side (rhs) is taken 
over a finite number of fc's. Equality f l20|) is called a Kraus representation, and M^'s 
are called Kraus operators. 

Proof. (Choi [92]) (Proof of ^) Suppose that £ is CP. Let {\ei)}i be an or- 
thonormal basis of H. An operator E := \ ® |ei)(ej| is positive because 

{ilj\E\ip) = I Ei(ei|(ei|^)P > for any G H®H. Since £ is CP, £®X is positive 
with X the identity on L{H). Therefore, the following operator is positive: 

{£®X){E) = ^^(|e,)(e,-|) ® |e,)(e,-| G L{H' ® H). (21) 



■^Rigorously speaking, an instrument is a map from K' to X^fceK' ^k, where K' is an element of 
a cr-algebra over K = {k}. 
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Therefore, it has a spectrum decomposition of the form {£ ®X){E) = \vk){vk\ 
with \vk) G H' ® H. We note that \vk) can be written as 1^;^) = kfc)|ej) with 
\xl) e H'. We then obtain 

i£®I){E) = Y,K){<\^\e^){e,\. (22) 

kij 

In addition, = Mk\ei){ej\Ml holds with := J^tl^Di^tl e L{H,H'). 

Therefore, we obtain 

S{\e,){e,\) = J2Mkh){eM (23) 
for any (i,j), which imphes Eq. fl2U]) . 

{Proof of ^) Suppose Eq. (I2D]). Then (f ®X„)(a) = ^^(Mfc®/„)(T(M^^(g)/„) holds for 
any cr G L(i? (8)C"), where X„ and J„ are the identities on L(C") and C", respectively. 
We then obtain, for any G H' ® C", 

(V^|(£®X„)(a)|^) = 5^(V^fc|a|V^,), (24) 

where |'?/'fc) := ® /n|^)- Therefore, {'4'\{£ I„){a)\-ip) is positive if a is positive, 
which implies that £^ (g) X„ is positive and therefore S is CP. □ 

Theorem 2.1 was first proved by Kraus [SB] based on Stinespring's theorem |97j . 
The above proof is based on Choi's proof [92] . 

We note that the Kraus representation is not unique. We also note that the above 
proof implies that the A^-positivity is enough for S to have a Kraus representation 
with the dimension of H. The following theorem connects the condition of TP to 
the Kraus representation. 

Theorem 2.2 Let S : L{H) L{H') be a CP map. S is TP if and only if Kraus 
operators satisfy 

J2mIm, = I, (25) 

k 

where / is the identity on H. 



Proof. Suppose that £ is CPTP. We then have 



tr[p] = tr[£ip)] = tr 



(26) 



for any p G L{H), which implies Eq. fl25|) . Conversely, if Eq. fl25|) is satisfied, tr[p] 
tr[^(p)] holds. □ 
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We consider quantum measurements in terms of the Kraus representation. We 
first note that CP map £k is written as 



Since £ := Ylik^k is assumed to be CPTP, we have 



ki 



The probabihty fllSp of outcome k is then written as 



p{k) = tr 



tr[-Efcp], 



where we defined 



It is obvious that E^s satisfy 



Sk{p) = J2MkipMl 

i 



i 

Ek>0 



(27) 
(28) 

(29) 

(30) 
(31) 



and 

J2Ek = I. (32) 

k 

Inequahty fl^ confirms the positivity of the probabihty (i.e., p{k) > 0), and Eq. fl5^ 
confirms that '^^pik) = 1. Any {Ek} C LiH) satisfying fl?T]) and ([32D is called the 
positive operator- valued measure (POVM)o 

The projection measurement of an observable is a special case of the foregoing 
general formulation of quantum measurements. Let X = XkPk be an observable, 
where P^s are projection operators. We assume that Xk 7^ Xk> for k 7^ k'. If the 
Kraus representation of Sk is given by 

Skip) = PkpPk, (33) 

then the measurement is called the projection measurement of X. In this case, the 
POVM consists of projection operators {Pk}, and the probability of outcome k is 
given by p{k) = tr [P^p], which is a slight generalization of the Born rule ([2]). We 
also refer to the projection measurement of X as that of {Pk}- In particular, if Pk is 
written as Pk = \ipk){'^k\ for any k, the projection measurement of X is referred to 
as that of ortho normal basis {iV'fc)}- 

''Rigorously speaking, a POVM is a map from K' to J^keK' ^k, where K' is an element of a 
(T-algebra over K — {k}. 
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Example 2.6 We consider a simple model of a photodetection. Suppose that H is 
2-dimensional and describes a two-level atom. Let {|0), |1)} C be an ortho normal 
basis, where |0) and |1) respectively describe the ground state and the excited state. 
The atom emits a photon with probability p if it is in the excited state. We observe 
the photon number with unit efficiency, where the outcome is given by "0" or "1." In 
this case, the Kraus operators are given by 

Mo := |0)(0| + v/r^|l)(l|. Ml := v^|0)(l|, (34) 

which leads to the POVM that consists of 



E, := |0)(0| + (1 -p)|l)(l|, E, := (35) 

If p = 1 holds, this measurement becomes the projection measurement of X := 
a;o|0)(0| + a;i|l)(l|, where we can define Xq := and Xi := 1. 



2.2.4 Indirect Measurement Model 

We next show that any quantum measurement described by {Sk} with £k : L{H) — )■ 
L{H) can be written by a simple model of an indirect measurement. 

Theorem 2.3 Let {Sk} be an instrument with £k : L{H) L{H). There exist an 
auxiliary system R described by Hr, unitary operator U G L{H ® Hr), a reference 
state \(f)ji) G Hfi, and projection operators {Pk} C Hj^ satisfying J2k^k ~ ^^'^^ 
that 

Skip) = ti rHI ^ Pk)Up ^ \(l)n){(l)R\U\l(S) Pk)] (36) 
for any p and k, where I is the identity on H. 

Proof. Let {Mki}i C L{H) be the set of Kraus operators of Sk, which is given 
by Eq. (|27|) . We introduce an auxiliary system such that Hr has an orthonormal 
basis {\ki)}ki- We first show that, for any G Hr, there is a unitary operator 
U e L{H ® Hr) satisfying 

UmcPR) = Y,Mkrmk^)- (37) 

ki 

In fact, U conserves the inner product on \ip) G H} <Z H ® Hr, that is, 

(j2{^\{k7\Mi,] (Y.Mkmk^)] = J2{mLMk:\^p) = m 

\k'i' / \ ki j ki 

holds for any I?/'), G H . We then have 

f/p ® 1 I = 5^ MfcipM^t, ., ®\ki) {k'z' I . (39) 

kik'i' 
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By defining := \ki){ki\ G L{Hji), we have 

(/ ® Pk)Up ® ® P,) = 5^Mfc,pM,V ® \kt){kt'\, (40) 

and therefore 

tr^[(/ ® Pfc)[/p ® |0^)(0«|f/t(/ ® P,)] = 5^ Mfc,pM,^, = Skip), (41) 

i 

which implies fl36l) . □ 

Physically, P can be regarded as a probe system such as the local oscillator of 
a homodyne detection, and as the initial state of the probe such as a coherent 
state. The measured system described by H interacts with the probe system by the 
unitary evolution. We next perform the projection measurement with {Pk} on the 
probe, and obtain the information about p. The effect of this indirect measurement 
is characterized only by instrument {£k}- In the case of a CPTP map. Theorem 2.3 
reduces to the following corollary. 

Corollary 2.1 Let £ : L{H) — L{H) be a CPTP map. There exist an auxiliary 
system P described by H^, unitary operator U G L{H ® Hr), and a reference state 
\(t)R) G Hji such that 

E{p)=iiR[Up®\ct)n){MU^]- (42) 

The above corollary implies that any nonunitary evolution 8 : L{H) — )■ L{H) can 
be modeled by a unitary evolution of an extended system. 

We note that, in Theorem 2.3 and Corollary 2.1, the initial state of the total 
system described hy H ® Hr is a product state. In fact, if a CP map is reproduced 
by a single indirect measurement model for an arbitrary input state p G Q{H), then 
the initial state of P should be independent of p. 

2.2.5 Heisenberg Picture 

We briefly discuss the Schrodinger and the Heisenberg pictures of time evolutions. 
We define an inner product of X, y G L{H) by 

(X,F)Hs:=tr[XtF], (43) 

which is called the Hilbert-Schmidt inner product. We define the adjoint of a linear 
map £ : L{H) — )■ L{H') in terms of the Hilbert-Schmidt inner product. A linear 
map S'^ : L[H') — t- L{H) is the adjoint of £ if it satisfies 

{£\X),Y)ns = {X,£{Y))ns (44) 

for any X G L{H') and Y G L{H). If £ is CP and its Kraus representation is given 
by Eq. fl2Ul) . the adjoint of £ is written as 

£\X) = Y,MlXMu, (45) 

k 
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which imphes that is also CP from Theorem 2.1. We note that a CP map E is TP 
if and only if E'^ is unital, because E\l) = J2k ^l^k- 

Let p e Q{H) be a state, X e L{H') be an observable, and £ : L{H) L{H') 
be a CP map. We then have 

ti[X£{p)]=iip{X)pl (46) 

where the left-hand side (Ihs) is called the Schrodinger picture, while the rhs is called 
the Heisenberg picture. 

3 Quantum Relative Entropy 

We now introduce the quantum entropies and discuss their basic properties. 

3.1 Von Neumann Entropy 

We first introduce the von Neumann entropy |95j . 

Definition 3.1 The von Neumann entropy of p G Q{H) is defined as 

S{p):=-ii[p\np\. (47) 

Remark 3.1 We note the relationship between the von Neumann entropy and the 
classical Shannon entropy |101lll02j . Let {p(a)}agA be a probability distribution on 
a set A. We regard the distribution as a vector whose a-th entry is p(a), which we 
denote as p := {p{a))aeA- The Shannon entropy (or the Shannon information) of p 
is defined as 

iJ(p):=-5^p(a)lnp(a). (48) 

If the spectrum decomposition of p is given by p = Yli'P{.'^)\Va){Va\ with an orthonor- 
mal basis {\ipa)}a&A, the von Neumann entropy of p reduces to the Shannon entropy 
of p: 

S{p) = Hip). (49) 
The following theorems describe basic properties of the von Neumann entropy. 

Theorem 3.1 Let p^'s are density operators whose supports are mutually orthogo- 
nal. Then the von Neumann entropy of p := ^j^Pkpk with ^Pfc = 1 satisfies 

S{p) = H{p) + Y.p,S{pk), (50) 
it 

where H{p) := -J^^Pk^^^Pk- 
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Proof. Since the supports of p^'s are mutually orthogonal, we have 

Sip) = -^tr[pfcPfcln(pfcPfe)] 

k 

= - J^tr[pfcpfc(lnpfc + Inpfc)] (^^^^ 

k 

= - J^Pfclnpfc - ^Pfctr[pfclnpfc], 
k k 

which implies Eq. (13U]) . □ 

Theorem 3.2 Let |\1') G Ha ® Hb be a state vector of a composite system, whose 
partial states are given by p^ := trB[|\&)(\l/|] and p^ := trA[|\I^) (^|]. Then 

5(P^) = Sip^). (52) 

Proof. Let |^) = J2k \/Pk\Vk)\4'k) be the Schmidt decomposition of |\E') with J2kPk = 
1. Then p^ = J2kPk\'^k){<^k\ and = Y.kPk\^k) {^k\ hold, and therefore S'(p^) = 

3.2 Quantum Relative Entropy and Its Positivity 

We now introduce the quantum relative entropy and prove its positivity. 

Definition 3.2 Let p,cr E Q{H). The Quantum relative entropy of p to a is defined 
as 

S{p\\a) :=tr[plnp] -tr[plna]. (53) 

If there exists \ip) G H that satisfies alip) = and {ip\p\ip) 7^ 0, the quantum relative 
entropy is defined as S{p\\a) := +00. 

Remark 3.2 We note the relationship between the quantum relative entropy and the 
classical relative entropy (the KuUback-Leibler divergence) [i|ll02] . Let p := {p{a))aGA 
and q := {q{a))aeA be probability distributions on a set A. The classical relative 
entropy of p to g is defined as 

S{p\\q):=^p{a)\n^^. (54) 

If two density operators are given by p = Y.aPi^)\'^a){fa\ and cr = g (a) (v^al 
with an orthonormal basis {\ipa)}a£A, the quantum relative entropy reduces to the 
classical one: 

Sip\\a) = Sip\\q). (55) 

We next prove the positivity of the quantum relative entropy, which plays a key 
role to derive the second law of thermodynamics in Sees. 5.2 and 5.3. 
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Theorem 3.3 (Positivity of the quantum relative entropy) 

Sip\\a) > 0, (56) 

where the equahty is achieved if and only if p = ex. Inequality f l56p is called the Klein 
inequality. 

Proof. Let p = Y.aPi"')\^a){^a\ aud a = Y.a9i^)\^a) {Va\, where {\^a)} and {\^Pa)} 
are orthonormal bases. We first show that 

S{p\W) > S{p\\a'), (57) 

where a' := J2a^(^)\'^a) {i^al- Inequality flFTI) is equivalent to — tr[plncr] > — tr[plncr']. 

We note that -tr[plno-] = - Y.aPi^){^ci\ lncr|?/'a) and -tr[plno-'] = - Y^aPi^) In(^akl^a)- 

By applying the Jensen inequality to convex function — Inx, we have 



■{i^a\lna\iJa) = - Y,m\^b)\'\nq{b) 



>-ln\^m^,)\\ib) 
= -\n{i)a\a\iJa), 



(58) 



where we used that J2b \ {'^a\Vb)\'^ = 1 holds for any a. Therefore, we obtain inequality 
f lFr|) . The equality in flFTI) is achieved if and only if |(V'a|v'/(6))P = ^ab, where /(■) is 
a bijection map and 6ab is the Kronecker delta. By relabeling the indexes of {1^9^)}, 
we can choose f{b) = b without loss of generality. 
We next show 

Siph') > 0, (59) 

which is equivalent to the positivity of the classical relative entropy "^^Pi^) 
By using inequality ln(x^^) > 1 — x for x > 0, we obtain 

^,(„),„|)>^,w(i_iW).o, m 

which implies inequality The equality in (IS^ is achieved if and only if p{a) = 

q{a) for any a. 

By combining inequalities fl571) and fl59|) . we obtain inequality fl56l) . The equality 
is achieved if and only if \{ipa\'^b)\'^ = Sab and p{a) = q{a), which implies p = a. □ 

By removing the assumptions that tr[p] = J2aPi^) ~ ^ ~ J2ali^) ~ ^ 

from the above proof, we can straightforwardly obtain a generalization of Eq. (1561) : 

tr [p(ln p - In a)] > tr [p - a] (61) 

for any positive operators p and a. Inequality (1611) is also called the Klein inequality. 

The subadditivity of the von Neumann entropy is a direct consequence of the 
positivity of the quantum relative entropy. 
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Theorem 3.4 (Subadditivity of von Neumann entropy) Let p E Q{Ha®Hb) 
be a density operator of a composite system, whose partial states are given by : = 
tr^ip^^] e Q{Ha) and p^ := tr^ip^^] G Q{Hb). Their von Neumann entropies 
satisfy 

S{p^^) < S{p^) + S{p^\ (62) 
where the equahty is achieved if and only if p^^ = p^ ® p^ . 

Proof. We have 

^(p^) + S(p^) - 5(p^^) = S(p^^||p^ ® p^) > 0, (63) 
where the right equality is achieved if and only if p^'^ = p^ ® p^ . □ 

3.3 Monotonicity of the Quantum Relative Entropy 

We next discuss that the quantum relative entropy is non-increasing under any CPTP 
map, which is called the monotonicity. The monotonicity can be applied to a deriva- 
tion of the second law of thermodynamics and to a lot of theorems in quantum 
information theory. 

Theorem 3.5 (Monotonicity of the quantum relative entropy) Let p,a E Q{H) 
be states and £ : L{H) — )■ L{H') be a CPTP map. Then 

S{8{p)\\£{a))<S{p\\a), (64) 

which is called the Uhlmann inequality. 

Several proofs of the monotonicity have been known, which are not so simple. 
One of the proofs will be shown in Appendix B. The following corollary is a special 
case of the monotonicity. 

Corollary 3.1 Let p^^,(7^^ E Q{Ha ® Hb) be density operators of a composite 
system, whose partial states are given by p^ := tiBlp"^^] and := trB[o"^'^]. Then 

S{p^h^) < 5(p^^||(T^^). (65) 

Proof. Apply the monotonicity ([64D to a CPTP map 8 : L{Ha ® Hb) L{Ha) 
such that £{p) := trij[p] for p E L{Ha <S) Hb). □ 

We also have the following corollary, which implies that the von Neumann entropy 
is non- decreasing for a special class of CPTP maps. 

Corollary 3.2 Let 8 : L{H) L{H) be a unital CPTP map satisfying 8{I) = /, 
where / is the identity on H. The von Neumann entropy is then non-decreasing: 

S{p) < S{8{p)). (66) 
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Proof. Let d be the dimension of H. We than have 



S{p) 



S{p\\I/d)+\nd< -S{£{p)\\£{I/d)) + \nd 
S{S{p)\\I/d) + lnd = S{S{p)), 



(67) 



where we used the monotonicity of the quantum relative entropy. □ 

Let S{p) = J2k ^kpMl be a Kraus representation of £. The condition of £{I) = I 
is satisfied if all of the Kraus operators are Hermitian such that = M^. In 
particular, £{I) = I holds if M^'s are projection operators 

The strong subadditivity of the von Neumann entropy is easily obtained from the 
monotonicity of the quantum relative entropy. 

Theorem 3.6 (Strong subadditivity of the von Neumann entropy) Let p^^*^ g 



Q{Ha®Hb®Hc), := trc[p^^^], p^^ := tr^fp^^^], andp^ := tiAc[p^''% Then 



Proof. Let := lA/d^, where I a is the identity on Ha and c?^ is the dimension 
of Ha- We then have 



where we used the monotonicity of the relative entropy for a CPTP map £ : L{Ha ® 
Hb (8) He) -> L{Ha ® Hb) such that £{p) = trc[p] for p G L{Ha (S) Hb (S) He). □ 

Historically, the strong subadditivity (1681) of the von Neumann entropy was first 
proved based on the Lieb theorem [T0Hl2j. Later, the monotonicity inequalities flMl) 
and fl65|l for the quantum relative entropy were proved from the strong subaddi- 
tivity (l68l) [3-[9]. On the other hand, Petz |103j proved monotonicity fl65l) without 
invoking the strong subadditivity. Nielsen and Petz |lU4j pedagogically discussed this 
proof. In a similar manner, Petz |1U5] showed a direct proof of monotonicity (j64p . 
which we will discuss in Appendix B. 

Remark 3.3 In contrast to the quantum case, it is easy to prove the monotonicity 
of the classical relative entropy [102] . Let p := {p{a))aeA and q := {q{a))a£A be 
probability distributions on A. The classical counterpart of a quantum CPTP map 
is a Markov maps, which is given by transition probabilities {r{b\a)}a(zA,beB with 
^^r(6|a) = 1 such that 



their von Neumann entropies satisfy 




(68) 



[Sip^^) + S{p^^)] - [S{p^^^') + S{p^)] 
= [S{p^^) - S{p^^^)] - [S{p^) - S{p^^)] 
= 5(p^^^||(T^ ® p^^) - 5(p^^||a^ ® p^) 
>0, 



(69) 



P'ib) := ^r{b\a)p{a), q\h) := ^r{h\a)q{a). 



(70) 



a 



a 
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We write S{p) := {p'{b))beB and £{q) := {(l'{b))b^B- Our goal is to show 



S{p\\q)>S{S{p)\\S{q)) 



(71) 



for the classical relative entropy. Let p{a,b) := r{b\a)p{a) and q{a,b) := r{b\a)q{a). 
We then have 



where p (a 1 6) := p{a,b)/p'{b) and q{a\b) := q{a,b)/q'{b). By noting that 
^^^p(a, fe) ln(p(a|6)/g(a|6)) > holds from the positivity of the classical relative 
entropy, we obtain inequality fITT]) . We note that the strong subadditivity of the 
Shannon entropy straightforwardly follows from the monotonicity of the classical rel- 
ative entropy. 

4 Quantum Mutual Information and Related Quan- 
tities 

In this section, we discuss the basic properties of the quantum mutual information 
and related quantities. In particular, we introduce two important quantities that 
are closely related to the quantum mutual information: the Holevo x-Quantity and 
the QC-mutual information (the Groenewold-Ozawa information). We discuss their 
information-theoretic meanings. 

4.1 Quantum Mutual Information 

We first introduce the quantum mutual information. 

Definition 4.1 Let p^-^ G Q{Ha®Hb) be a quantum state and := trB[p^^] and 
:= ti aIp"^^] be its partial states. The mutual information between two systems is 
then defined as 




(72) 



S{£{p)\\£{q)) + Y,VM In 



P{a\b) 
q{a\b) 



jA:B^pAB^ := 5(p^^||p^ ® p^) = S{p^) + 5(p^) - 5(p^^). 



(73) 



From the positivity of the quantum relative entropy. 



jA:B^pAB^ > 



(74) 



holds, where the equality is achieved if and only if p^^ = p^ ® p^ . 
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Remark 4.1 We note the relationship between the quantum mutual information and 
the classical mutual information. Let {\fa)}aeA and {\i'b)}beB be orthonormal bases 
of Ha and Hb, respectively. We define 

•= 5^PK^)l<^a)(<^a| ® \i^b){i^b\, (75) 
ab 

where {p{a,b)}(^a,b)£AxB is a classical probability distribution on A x B. Then the 
quantum mutual information reduces to 

jA:B(^pAB^^J2pia,b)ln^^ (76) 
^ p{a}p{b) 

where p{a) := J2bPi^y^) Pi^) ■~ J2aPi^^^)- "^^^ -^1- (ESD is the classical 

mutual information between A and B. 

We now discuss the data processing inequality, which is a straightforward conse- 
quence of monotonicity f lM|) of the quantum relative entropy. 

Theorem 4.1 (Data processing inequality) Let £a '■ L{Ha) — ?■ L{Ha') and 
Sb '■ L{Hb) — i- L{Hb>) be CPTP maps. The quantum mutual information is non- 
increasing hj £a ® Sb- 

I^'-'^^'USa ® £b){p^'')) < /^^^(p^^). (77) 

Proof. Noting that {£a ® £b){p^ ® P^) = ^a(p^) ® ^b(p^) and 

tiB'liSA ® ^b)(P^'')] = Sa{p^), tTA'liSA ® ^i.)(p^^)] = ^ij(p^), (78) 

inequality (1771) follows from the monotonicity (16^ of the quantum relative entropy. 
□ 

The data processing inequality states that the quantum mutual information never 
increases by any CPTP map that is performed on each systems individually. The 
following corollary is a special case. 

Corollary 4.1 We consider three systems corresponding to Ha, Hb, and Hq. Then 

jA.b^^ab^ < /^^^^(p^^^). (79) 

Proof. By taking £bc{p^'^) '■= trc[p'^'"] and applying Theorem 4.1 to Xa ®£bc, we 
obtain inequality (|7^ . □ 

Remark 4.2 We note that inequality fl79|) can be written as 

5(p^) + S{p^) - 5(p^^) < S{p^) + 5(p^^) - 5(p^^^), (80) 
which is equivalent to the strong subadditivity (l68i) . 
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4.2 Holevo's ^-quantity 

We next introduce the Holevo's x-Quantity (or just the x-quantity) that is related to 
the accessible classical information encoded in quantum states [70I472] . 

Definition 4.2 Let A be a finite set, {p{a))a£A be a probability distribution with 
J2aPi^) ~ Pa. ^ Q{Hs) be a quantum state labeled by a G A. The x-Quantity 

is defined as 

X^'--=Sip)-J2pi^)SiPa), (81) 

a 

where p := Y.aPi'^)Pa- 

We introduce an auxiliary system Ha with orthonormal basis that can 

store the classical information about a. We define a density operator 

p"^^ ■.= ^p{a)\^a){Va\® Pa- (82) 
a 

The x-quantity is then given by the mutual information 

X^' = I^-%P^'\ (83) 

which is a useful formula. 

Remark 4.3 We note the relationship between x-quantity and the classical mutual 
information. Let {\'4^h)}h&B be an orthonormal basis of Hs- We assume that p^'s can 
simultaneously be diagonalized as 

Pa = 5^p(&|a)|V'fe)(V'6|, (84) 
b 

where 'YlibPi^\^) ~ ^ ^'^y ^- ^^^^ case, we can straightforwardly show that 

X-^^ = I^--^ (85) 

where I^'^ is the classical mutual information between A and B for the joint distri- 
bution p(a, h) := p{b\a)p{a). 

The following theorems describe important properties of the x-quantity and the 
von Neumann entropy. 

Theorem 4.2 (Concavity of the von Neumann entropy) The x-Quantity sat- 
isfies 

X^^ > 0, (86) 

or equivalently 

S{p) > J2pia)S{pa), (87) 

a 

which is called the concavity of the von Neumann entropy. The equality is achieved 
if and only if p{a) = 1 for a single a. 
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Proof. The inequality is obvious from the positivity of the mutual information in 
Eq. (1831) . The equality is achieved if and only if p^^ is a product state, which implies 
that p{a) = 1 holds for a single a. □ 

Theorem 4.3 The x-Quantity satisfies 

X^' < Hip), (88) 

or equivalently 

S{p)<H{p) + J2pici)S{pa), (89) 

a 

where H{p) := — ^^^^(a) Inp(a). The equality is achieved if the supports of p^'s are 
mutually orthogonal. 

Proof. We introduce an auxiliary system H^' with orthonormal basis {\'ipa)}a£A- 
We define a state 

a^^' := Y,Pi<^)\¥>a){^a\ ® mi^Pal G Q{Ha®Ha'), (90) 

a 

where the mutual information between A and A' is given by I"^'^' [a^^') = H(p). On 
the other hand, we define a CPTP map S : L{Ha') L{Hs) such that 

£{\^a){i^a\)=Pa (91) 

for any a. In fact, we can construct £ satisfying Eq. fPTl) as follows. Let pa = 
^iQaii)\ci'i){0'i\ be the spectrum decomposition of pa- We define Kraus operators 

Mai := VqjJ)\ai){4Ja\ G L{Ha',Hs), (92) 

which satisfies 

J^MlMa^ = X]9a(*)IV^a)(«^l«^)(^a| = l^a) {lpa\ = I^' , (93) 

ai ai a 

where I^' is the identity on Ha'- By defining £ with the Kraus operators (1921) . we 
have 

^(|V^a)(^a|) = 5^ga(^)|a«)(a2| =Pa, (94) 
i 

which confirms Eq. (19T]) . 

By applying £ ® to cr"^^ with Ia the identity on L{Ha), we have 

(£ ® X^)(a^^') = 5^p(a)p„ ® =: p^^. (95) 

a 

Therefore, from the data processing inequality fl77|) . we obtain 

H{p) = I^-^'{a^^')>I^--'{p^') = X^', (96) 

which implies inequality (189 p . If the supports of p^'s are mutually orthonormal, the 
equality in (15^ is achieved because of Eq. (I5U]) . □ 
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Theorem 4.3 implies that Eq. (IHJ]) is replaced by inequality ( 15^ if the supports 
of Pa's are not mutually orthogonal. The following corollary is a direct consequence 
of Theorem 4.3. 

Corollary 4.2 We define p := 'YlaP{^)\^a) {<t>a\ with ^aP(a) = 1, where |(/)a)'s are 
not necessarily mutually-orthogonal. We then have 

S{p) < H{p), (97) 

where H{p) := — XlaPl*^) ^^pI*^)- 

Proof. Apply Theorem 4.3 to pa ■= \(pa){(pa\- D 

We next show the data processing inequality for the x-Quantity. 

Theorem 4.4 (Data processing inequality) We define 

X""'' ■.= SiSip))-J2pi^)Si£iPa)), (98) 

a 

where £ : L{Hs) L{Hs') is a CPTP map. Then 

X'^'' < X^'- (99) 

Proof. From Eq. ^ and inequality ([77D, we have x'^^' = I^'-^' {{Xa ® £){p^^)) < 
jA:S(^pAs^ ^ ^As^ where Ia is the identity on L{Ha)- □ 

We next formulate and prove the Holevo bound, which determines the upper 
bound of the accessible classical information that is encoded in a quantum system. 
We consider that the classical information about a & A is encoded in a quantum 
state Pa G Q{Hs)- We extract the information about a by performing a quantum 
measurement on the quantum system. 

Let {-EfcjbgB be a POVM with a finite set B. The probability of obtaining outcome 
6 e 5 by a measurement on state pa with the POVM is given by 

p{b\a) = tT[E,pa]. (100) 

The joint probability of (a, b) is p{a, h) = p{b\a)p{a), whose marginal distributions are 
p{a) := XlfePl'^'^) Pi^) ■~ SaPl*^'^)- The mutual information I^'^ between the 
two classical variables is then given by 



'^^^-Efio^i-m- (101) 



a,b 



The Holevo bound states that the upper bound of the classical mutual informa- 
tion fllOip is bounded by the x-quantity. 
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Theorem 4.5 (Holevo bound) 

J^^^ < x^"" (102) 

holds for any POVM {Eb}beB- 

We will discuss two proofs of Theorem 4.5 in the followings. The first proof [T^ is 
more intuitive than the second one [7T] , while the second one is simpler than the first 
one. Both proofs are based on the monotonicity of the quantum relative entropy. 

Proof. We now discuss the first proof. We introduce two auxiliary quantum systems 
described by Hilbert spaces Ha and Hb- Their orthonormal bases are labeled by the 
corresponding classical variables as {|a)}agA C Ha and {\b)}b£B C Hb- We define 
p^^ e Q{Ha ® Hs) as 

p^^ ■.= ^p{a)\a){a\®pa, (103) 

a 

and define p^^^ := p^"^ ® |0)(0| G Q{Ha ® Hg ® Hb), where |0) G Hb is an 
initial reference state. It is easy to show that there is a CPTP map £sb acting on 
L{Hs ® Hb) such that 

{Xa®£sb){p^''') = 5^p(a)|a)(a| ® ^bPa^b®\h){h\ =: p'^^^, (104) 

a,b 

where Xa is the identity on L{Ha)- We note that £sb describes a measurement 
process corresponding to POVM {Ea}. 
Let p'^-^ := tr5[p^'^^]. We then obtain 

^AS ^ jA:S^pAS^ ^ jA-.SB^pASB^ > jA.SB ^^^ASB^ > /^^^(p'^^), (105) 

where we used Eq. (1551) and the data processing inequalities (1771) and (179]) . By noting 
that 

p'^"" = Y.p{aMa){a\®\h){hl (106) 

a,b 

we obtain 

jA-B^p,AB^ = JA-B^ (107) 

where the rhs means the classical mutual information fllOll) . Inequality fll05p and 
Eq. (fTUTD imply the Holevo bound (fT02|) . □ 

Proof. We next discuss the second proof. We note that the x-quantity can be 
written as 

X-^^:=$^p(a)^(Pa||p). (108) 

a 

Let < oo be the number of possible outcomes, and let p^, p' G Q{C^) be diagonal 
matrices, where every diagonal element is given by ir[Ebpc^ or tr[i?bp], respectively. 
We have 

I^--^ = Y.p{a)S{p:\\p'). (109) 

a 
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On the other hand, a hnear map £ : L{Hs) L{C^) satisfying £{pa) = p'a and 
S{p) = p' is CPTP. Therefore, from the monotonicity of the quantum relative entropy, 
we obtain 

S{pa\\p) > Sip'Jp') (110) 

for any a. By averaging inequahty f lllOp over all a's and by using Eqs. fllOSp and 
ffTOQl) . we obtain the Holevo bound flT02l) . □ 



4.3 QC-mutual Information (Groenewold-Ozawa Information) 

We next introduce a quantity called the QC-mutual information that is also related 
to the accessible classical information encoded in quantum states [73]475| [83]. We 
consider a quantum measurement described by POVM {i^fej^gs, where B = {b} is 
the finite set of outcomes. If the measured state is p G Q{Hs), the probability of 
obtaining outcome b is given by p{b) = tislEbp]. By defining 

P6:=^V^PV^, (111) 

we introduce the QC-mutual information as follows. 

Definition 4.3 In the above setup, the QC-mutual information (the Groenewold- 
Ozawa information) is defined as 

lSi:=S{p)-J2pimPb). (112) 

b 

We note that the QC-mutual information f lll2p only depends on the measured 
state p and the POVM {Eh}beB- We note that Groenewold [TS] and Ozawa [71] 
originally discussed the case that any Kraus operator Mb G L{Hs) satisfies Eb 
MlMb. 

Remark 4.4 We note the relationship between the QC-mutual information and the 
classical mutual information. Let {|(/3a)}aeA be an orthonormal basis of Hs- We as- 
sume that p and EbS are simultaneously diagonalized such that p = J2aP(^)\'^a) {Va\ 
andEb = ^aPi^\^)\^a){^a\ for any 6 G B. In this case, we have p,, = Y.aPi^\^)\Va) {'^al 
with p(a|6) := p{b\a)p{a) / (^aP(&|a)p(a))- Therefore, we obtain 

I^i = I^-^ (113) 

where I^'^ is the classical mutual information between A and B for the joint distri- 
bution p(a, b) := p{b\a)p{a). 

We consider a quantum measurement with the set of Kraus operators {Mb}b£B C 
L{Hs) satisfying Eb = M^Mb. Let p', := MbpMl/p{b) and p' := Y.bPi^)p'b- The 
QC-mutual information can then be written as 

^QC = X'"" - A^„.eas, (114) 
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where '■= S{p') — '^t,pib)S{p[) is the x-Quantity of the post-measurement states, 
and AS'meas := S{p') — S{p) is the change of the von Neumann entropy by the mea- 
surement. 

On the other hand, the QC-mutual information Iqq equals the X"Quantity of 
an auxihary system. Let Hf> be the Hilbert space of the auxihary system R, and 
|\E') G Hs ® Hfi be a purification of p such that 

trnm{n=P- (115) 

We define p^ := tr5[|^)(^|] and 

pf■.= tIs[i^/E^®I''mm^/E^®I'')]/p{b), (116) 

where G L{Hr) is the identity. We note that p^ = J2bPi^)Pb^- then obtain 
the following theorem: 

Theorem 4.6 The QC-mutual information satisfies 

/gc^ = X^^ (117) 
where '■= S{p^) — J2bPi^)Pb' the x-Quantity of {p^}b£B- 

Proof. By noting that trij[(v/^®/^)|^)(^|(v^®^^)]/p(&) = Pb, we have ^(pf,) = 
5(pf ) and S{p) = S{p^), which imply Eq. I^Hi)- □ 

Therefore, the QC-mutual information satisfies the following inequality. 
Corollary 4.3 

< I^^ < Hip), (118) 

where H{p) := — '}2b'PiP) lnp(6). 

Proof. Apply inequalities f lSB]) and f lHS]) to x^^- ^ 

We next consider an information-theoretic meaning of the QC-mutual information. 
We assume that classical information about a G A is encoded in p G Q{Hs) as 

p = J]g(a)p„ (119) 

a 

where p^'s are density operators, g(a)'s satisfy J2al(^) ~ 1' ^ assumed to be 
a finite set. We then perform a measurement with POVM {Ei,}h(zB- The probability 
of obtaining b under the condition of a is given by 

p{b\a) = tislEbPa]. (120) 

The joint distribution is p{a,b) := p{b\a)q{a). We note that the unconditional prob- 
ability of obtaining b is given by p{b) = J2aPi^'^) ~ ^'''s[Eap], and the QC-mutual 
information is defined by Eq. f lll2p with p^ := y/RhP^/Eb / p{b) ■ We then have the 
following theorem as a "dual" of the Holevo bound (Theorem 4.5) ^75] . 
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Theorem 4.7 In the above setup, the classical mutual information I^'^ between A 
and B is bounded by the QC-mutual information: 

I^-^ < I'^i. (121) 

Proof. Let pa = X]j Q'a(OlV'ai)(V'aj| be the spectrum decomposition of p^, where 
{|^ai)}i is an orthonormal basis of Hs- We introduce an auxiliary system described 
by Hr with orthonormal basis {\rai)}ai, and define a purification of p: 

m:=Y,^q{ajU^\^at)\rai) e Hs ^ Hr. (122) 

at 

We have p = tri?[|^')(^|] and = tTR[I^®P^\^){<i/\]/q{a), where := ^. \rai){rai\ 
and is the identity on Hs- On the other hand, we define p^ := tr5[|^') (\E'|] and 
pj^ := tr5[(/E^® /«)|^)(^|(v/E^® /^)]/p(6) = tTs[{E, (g) /^)|^)(^|]/p(6), where 
is the identity on Hr. By noting that 

trs [E.tiR [I' ® P,^|^)(^|]] = tr^ [Pi^tr^ [E, ® /^|^)(^|]] , (123) 

we obtain 

tTs[E,pa]qia) = tr«[PMp(6), (124) 

and therefore 

pia,b)= ti R[P^^p^]p{b). (125) 

Since Iqq = S{p^) — '^i,p{b)S{pl^) holds from Theorem 4.6, we obtain inequal- 
ity (1121 p by applying the Holevo bound to {pf jfegs- □ 

We note that, in the set up of the Holevo bound (Theorem 4.5), the encoding of 
the classical information is fixed and the measurement is arbitrary. In contrast, in 
the setup of Theorem 4.7, the encoding is arbitrary and the measurement is fixed. 
Inequality f ll2ip determines the upper bound of the accessible classical information 
under the condition that the measurement is given by {Ef,}h^B and the ensemble 
average of the encoded states is given by p. 

5 Second Law of Thermodynamics 

We now derive the second law of thermodynamics in three manners. The first deriva- 
tion is based on the positivity of the quantum relative entropy. The second derivation 
is based on the quantum fluctuation theorem, which is shown to be equivalent to the 
first derivation. The third one is based on the monotonicity of the quantum relative 
entropy. 
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5.1 Thermodynamic Entropy and the von Neumann Entropy 

Before going to the main part of this section, we briefly discuss the relationship 
between the thermodynamic entropy and the von Neumann entropy. We consider a 
thermodynamic system that is in thermal equilibrium. Let H be the Hamiltonian of 
the system. The Helmholtz free energy is defined as 

F ■=-(5-^\nii[e-^% (126) 

where /3 > is the inverse temperature of the system. The thermodynamic entropy 
5'therm satisfies 

5thcrm = /3((^)-F), (127) 

where {E) is the average energy of the system. The thermodynamic relation fll27p has 
been well established from the 19th century as a phenomenological thermodynamic 
relation for macroscopic systems. In terms of statistical mechanics, however, it is not 
so obvious to determine the microscopic expression of the thermodynamic entropy 
5'therm as wiU bc discussed in Sec. 7. 

As a special case, if we select the canonical distribution 

Pcan := e^(^-^) (128) 

as a microscopic expression of the thermal equilibrium state, we can easily show that 
the thermodynamic entropy is given by the von Neumann entropy of the system: 

Theorem 5.1 The von Neumann entropy of pcan satisfies 

^(Pcan) =/3((^)can-F), (129) 

where (-E)can := tr [//pcan] is the average energy in the canonical distribution. 

The statistical-mechanical relation fll29p is consistent with the thermodynamic 
relation f ll27p with the correspondence between S'(pcan) and S'therm- Therefore, in 
the main part of this article, we will identify a canonical distribution to a thermal 
equilibrium state, and the von Neumann entropy to the thermodynamic entropy; these 
identifications have been widely used in statistical mechanics. Some subtle points on 
the vahdities of these identifications will be discussed in Sec. 7. 

We can also easily calculate the quantum relative entropy of any state p G Q{H) 
to the canonical distribution pcan as 

5(p||Pcan)=/3(F-(i^))-5(p), (130) 

where {E) := tT[Hp] is the average energy of p. From the positivity of the quantum 
relative entropy, we have 

S{p) < ^{F - {E)), (131) 

where the equality is achieved only if p = pcan- Inequality fll3ip implies that the von 
Neumann entropy takes the maximum in the canonical distribution among the states 
that have the same amount of the energy. 
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5.2 From the Positivity of the Quantum Relative Entropy 

We now derive the second law of thermodynamics based on the positivity of the 
quantum relative entropy and the unitarity of the time evolution of the system [33|l57] . 
We first prove a very general but almost trivial equality and inequality, and next apply 
them to thermodynamic situations. Therefore, the nontrivial part of this subsection 
is in the applications to each examples. 

We consider a unitary evolution of the system from state p\ G Q{H) to pf G Q{H) 
such that Pf = UpiW. We also introduce a reference state po G Q{H), which is 
different from pi or pf in general. Since S{pi) = S'(pf) holds due to the unitary 
evolution, we have the following theorem: 

Theorem 5.2 In the above setup, 

-tr[pflnpo]-5(pi) = ^(pf||po) (132) 

holds, which leads to 

-tr[pflnpo]-5(pi) >0. (133) 

We note that we use the positivity of the relative entropy S{pi\\po) to derive 
inequahty (11331) . While Eq. (11321) and inequality (I133P are obvious, they play key 
roles to derive the second law of thermodynamics as shown below. We discuss two 
typical situations in the following. 

Example 5.1 We assume that the system is driven by a time-dependent Hamiltonian 
H(t) from t = to t = T, which gives the unitary operator as 

[/ = Texp (^-i^ H{t)d?j . (134) 

The free energy corresponding to the Hamiltonian at time t is given by 

F(t) := -/3-Mntr [e-^^W] , (135) 

where /3 > 0. We assume that the system is initially in the canonical distribution at 
inverse temperature (3 such that 

p.,= eP(m-Hm^ (136) 

and define the reference state as 

p^:=eP(nr)-mr))_ (137) 

We stress that pf ^ po in general. In this setup, we have 

tr [pf In Po] = (3 {Fir) - tr [Hpf]) . (138) 
Therefore, Eq. fll32p reduces to 

P{{W)-AF) = S{p,\\po), (139) 
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where 

{W) :=tr[pfi7(r)]-tr[pii7(0)] (140) 
is the energy difference of the system, and 

AF := F(r) - F(0) (141) 

is the free-energy difference corresponding to the initial and final Hamiltonians. We 
note that the energy difference (W) is regarded as the work performed on the system 
in this setup, because any heat bath is not attached to the system. Corresponding to 
inequality fll33p . we obtain the second law of thermodynamics 

{W) > AF. (142) 

Example 5.2 We assume that the total system consists of the main system S and 
heat baths Bk {k = 1,2,---). The Hilbert spaces corresponding to S and Bk are 
respectively given by Hs and Hb, so that H = Hs ®k Hb,. Let e L{Hs) 
be the system's Hamiltonian, H^'' G L{Hb^) be the /cth Bath's Hamiltonian, and 
jjSBk ^ L{Hs^ Hsf.) be the interaction Hamiltonian between S and B^. We assume 
that and H^^'= are time-dependent, while H^'= is time- independent. The total 
system then obeys a unitary evolution from t = to t = t corresponding to the total 
Hamiltonian 

H{t) = H^{t) + J^iH^^'it) + H^^), (143) 

k 

where we omitted to write the tensor products with the identities on Hs and i/^^'s. 
Let H^^'^it) := Ylk ^^^''i^)- -^^^ simplicity, we assume that the interaction Hamilto- 
nian satisfies H'"'\0) = H''''\t) = 0. 

Let be the canonical distribution corresponding to H^'' such that 

Pfa^n := e^^(^^-^''\ (144) 
where /3fc > is the inverse temperature of Bk, and 

Ffc:=-/3-Mntr[e-^^"^] . (145) 
We assume that the initial state of the total system is given by a product state 

Pi :=pf ®fcPfan, (146) 

where pf is an arbitrary initial state of the system. We note that Eq. fll46p is consis- 
tent with assumption if™*(0) = 0. The final state is given by pf = UpiW , where U is 
given by Eq. (11341) with the total Hamiltonian f ll43p . The final state of S is given by 

pf ■.= tTB[pf], (147) 

where tre means the trace over all Hb,.'s. We then define the reference state as 

Po ■■= pf pfin- (148) 
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In this setup, we can show that Eq. fll32p is equivalent to 

AS-J2l^k{Qk)=S{pi\\po), (149) 

k 

where 

AS := Sipf) - 5(pf ) (150) 
is the difference in the von Neumann entropy of the system, and 

{Qk) := tr [if^Vi] - tr [//^''Pf] (151) 

is regarded as the heat that is absorbed by S from bath Bk due to assumption 
H^"^{0) = if™*(r) = 0. Corresponding to inequahty (11331) . we obtain the Clausius 
inequahty 

AS-J2Pk{Qk)>0. (152) 

k 

We note that, in the conventional thermodynamics, the initial and final states of the 
system are assumed to be in thermal equihbrium. On the other hand, we assumed 
nothing on pf and pf above. Therefore, inequahty fll52p is regarded as a generahzation 
of the conventional Clausius inequality to situations in which the initial and final 
states of the system are out of equihbrium. 

In the following, we additionally assume that the initial state of the system is 
given by the canonical distribution at inverse temperature /3 such that 

pS ^^PiFSiO)-HSiO))^ (153) 

where 

F'{t) := -r'tr [e-'^^'W] . (154) 

This assumption is consistent with assumption i7™*(0) = 0. By using notation p^ := 
g/3(FS(r)-ffS(r))^ we obtain 

Sipf) < -tr [pf Inpo^] = (3 (F(r) - tr [H^pf]) , (155) 
where we used the positivity of S'(pf ||pq). Therefore, we obtain 

(A^-^) - AF^ > AS, (156) 

where 

(AE^) := tr[pf i7^(r)] - tr[pfi7^(0)] (157) 

is the energy difference of the system. By combining inequalities fll52p and fll56p . we 
obtain 

(3 ((Ai^;^) - AF^) - J2 Pk{Qk) > 0. (158) 

k 
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For a special case in which there is a single heat bath at inverse temperature 
inequality fll33p reduces to 

(W) > AF^, (159) 

where 

{W) := (E^) - (Q) (160) 
is the work performed on the system. 

The argument in this subsection is based on the positivity of the quantum relative 
entropy and the unitary evolution of the total system. We can replace the unitary 
evolution by a unital CPTP map S satisfying S{I) = I. In this case, the von Neumann 
entropy of the total system is non-decreasing as S{pi) < S{pf), which has been shown 
in Corollary 3.2. Thus, Eq. fll32p is replaced by an inequality 

-tr[pflnpo]-5(A)>5(pf||po), (161) 

and therefore, inequality 01331) remains unchanged. As a consequence, inequali- 
ties (11421) and 01521) still hold for such a CPTP map S acting on the total system. 



5.3 Prom the Quantum Fluctuation Theorem 

The quantum fluctuation theorem gives information about fluctuations of the entropy 
production P5H5^ . We first introduce the stochastic entropy production and formu- 
late the quantum fiuctuation theorem in a very general setup. Let Pi,po G Q{H) 
be density operators. They have spectrum decompositions p\ = J2aPii^)\'^a){i^a\ 
and po = ^fePo(^)|06)(0fe|5 where {|'?/'a)} and {|0b)} are orthonormal basis of H. To 
formulate the quantum fiuctuation theorem, the key concepts are the forward and 
backward processes that are described as follows. 

Forward process. In the forward process, the initial state is given by pi. We first 
perform the projection measurement on pi with basis {iV'a)}) and obtain outcome 
a with probability Pi{a). By this measurement, the ensemble average of the post- 
measurement states equals p;. We next perform a unitary operation with a time- 
dependent Hamiltonian H{t) from t = to r. The unitary operator is given by 
Eq. f ll34p . The density operator of the system then becomes pf = U p-JJ^ . We next 
perform the projection measurement on pf with basis {|0b)}, and obtain outcome h 
with probability Pf(fe) := {(t)b\Pi\<t'b) ■ The joint probability of (a, 6) is given by 

p(a, h) := pip <— a)pi{a), (162) 

where 

pib^a) := |(0fe|f/|^a)P (163) 
is the transition probability. We note that Pf(6) = ^^aPi^'^)- 
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Backward process. To formulate the backward process, we need to introduce the 
time-reversal operator acting on H, which is an anti- unitary (i.e., inner-product 
preserving and anti-linear) operator satisfying 0^ = and 0^ = 0. Here, an anti- 
linear operator satisfies that, for any Ifi), \ip2) € H and ai,a2 G C, 

0(ai|(^i) + a2\ip2)) = aie\(pi) + a*e\(p2), (164) 

where a* means the complex conjugate of aj. Let \ipa) '■— ©iV'a), l^fe) '■— ©l^b), 
and po := QpoQ — '^i,Po{b)\^b) {^b\- We note that {ji^a)} and{|06)} are orthonormal 
bases of H. 

The protocol for the backward process is as follows. The initial state of the 
backward process is given by pq. We first perform the projection measurement on po 
with basis {l^b)}, and obtain outcome b with probability PQ{b). By this measurement, 
the ensemble average of the post-measurement states equals po. We introduce the 
time-reversal of the Hamiltonian as 

H(t) := eH{t)e. (165) 

For example, if the Hamiltonian depends on magnetic field B as H(t;B), its time- 
reversal is given by H{t]B) = H{t]—B). We next perform a unitary operation 
from t — to t — T with the time-reversed control protocol of the time-reversed 
Hamiltonian. The corresponding unitary operator U is given by 

U -.^T exp (^-i H{t - t)d?j . (166) 

We next perform the projection measurement on U PqU^ with basis {]'?/'«)}) and obtain 
outcome a with probability Pf(a) := {''pa\UpoU'^\'ipa)- The joint probability of (6, a) in 
the backward process, denoted by p(6, a), is then given by 

p{b,a)^p{a^b)po{b), (167) 

where 

p{a^b)^\{i>amb)\^ (168) 
is the backward transition probability. We note that Pi{a) :— P(^) 



We define the following quantity: 

a(a,6):=ln?^, (169) 

which is referred to as the stochastic entropy production in the forward process. The 
average of the entropy production is given by 
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which is positive because of the positivity of the classical relative entropy: 

(a) > 0. (171) 

We discuss the relationship between inequality f ll7ip and inequality fll33p in The- 
orem 5.2 in Sec. 5.2. The following theorem plays a key role. 

Theorem 5.3 The classical relative entropy f ll70p can be written as 

(a) = 5(pf||po), (172) 

where ^(pfllpo) is the quantum relative entropy. 

Proof. We first note that 6^6 = W holds, because e(i^(t))0 = -iH{t) holds. 
We then have the key observation that the unitary evolution has a time-reversal 
symmetry: 



= \{^|J,\U^<p,)\' = mU\^Pa)\'=p{b^a). 



(173) 



Therefore, we obtain 



which leads to 



a(a,6) = ln^, (174) 



(a) = J]p(a, 6) In ^ = Y^pM) InPi(a) - $^Pf(&) lnpo(&). (175) 



a,b 



Obviously, 



^Pi(a) Inpi(a) = -S{pO = -S{pi). (176) 



We also obtain 



Ypf{b)\npo{b) = ^{(pblpM lnpo(&) 

b b 

= X^(0f>|Pf Inpol^fo) =tr[pflnpo]- 



(177) 



b 

By combining Eqs. (I176p and ( I177p . we obtain Eq. fll72p . □ 
From Eqs. f ll32p and fll69p . we obtain 

(a) = -tr[pflnpo]-5(pi). (178) 

Therefore, inequality (11711) is equivalent to inequality f ll33p . Since inequality fll33j) 
leads to inequalities (11420 and (11520 as special cases, these inequalities can also be 
regarded as special cases of inequality (I17ip . which we will discuss in detail later. 



34 



Equality ( 11691) is regarded as a general expression of the quantum fluctuation 
theorem. The reason why Eq. fll69p can be called a "theorem" rather than just a 
definition lies in the fact that a equals to some important thermodynamic quantities 
for special cases, as we will show in Examples 5.3 and 5.4. Strictly speaking, Eq. (11691) 
should be called a theorem only for such cases with the thermodynamic expressions 
of a. In fact, we have already shown Eq. fll78p . which implies that the average of o 
reduces to the Ihs's of ffT^ and ffT^ . 

Before going to such special cases, we discuss the some properties of o based on 
Eq. fll69p . To do so, we introduce the entropy production in the backward process as 

a(6,a):=ln^. (179) 

In the backward process, 6po0 and 6pi6 respectively play the roles of p\ and po in 
the forward process. Therefore, definition (I179p in the backward process is consistent 
with the definition (I170p in the forward process. We note that 

a(a,6) = -CT(6,a). (180) 

We introduce the probability distribution of a as 

p(a = S) := 5^p(a, 6)5(S, a(a, 6)), (181) 

a, 6 

where 5(-, ■) is the Kronecker delta, and that of a as 

V{p = S) := 5^p(6, a)5(S, a{b, a)). (182) 



b,a 



We can show that 



because 



p(a - -^j 



p{a = S) 



e"^, (183) 



p{a = -S) = ^p(6, a)5(-S, a{b, a)) 

a,b 

= 5^p(a,6)e^(''''^)5(-S,a(6,a)) 

a,b 

= e-^ ^)'^(-^' ^(^' «)) ^^^^^ 

a, 6 
a,b 

= e-^p{a = S). 

We also refer to Eq. fll83p as the quantum fluctuation theorem. We can show that 

(e-'^) = 1, (185) 
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because 

(e-'^) := = S)e-^ = ^pia = -S) = 1. (186) 

s s 

Equality fll85p is called the integral fluctuation theorem or the quantum Jarzynski 
equality. By using the Jensen inequality for the exponential function (i.e., e"^*^^ < 
{e^")), we reproduce inequality f ll7ip from Eq. ( I185p . We note that the quantum 
fluctuation theorems f ll69p . f ll83p . and f ll85p were obtained by Kurchan [32] and 
Tasaki 133). 



Example 5.3 We consider the case of Example 5.1 in which p\ and po are given 
by Eqs. ( 11361) and ffT37p . respectively. Let H{Q) = Ea ^a(0)|^a)(^/'a| and H{t) = 
E(,£^fe(r)|0fe)(0fe| be the spectrum decompositions of the initial and final Hamiltoni- 
ans, which leads to Pi{a) = e^(^(°)"^"(°)) and poib) = e^(^(^)--^''(^)). The stochastic 
entropy production (ll7Up is then given by 

a{a, b)=\n^ = /3{AF - W{a, b)), (187) 
Po[b) 

where AF := F(r) - F(0) and W{a,b) := Eb(r) - ^^(0). We then obtain 

(W) := 5^p(a, b)W{a, b) = ti[H{r)p,] - tr[i/(0)A], (188) 

a,b 

which is consistent with Eq. fll40p . In this case, the integral fluctuation theorem fll85p 
reduces to 

{e-^W) = e-^\ (189) 

which is called the quantum Jarzynski equality. Inequality fll7ip reduces to fll42p in 
this situation. 

Example 5.4 We next consider the case of Example 5.2 in which pi and po are 
given by Eqs. fll46p and fll48p . respectively. Let pf := X^a' pf and pf := 

Y^h' Pi i^')\'^h>) {'t'b>\ tie the spectrum decompositions of the initial and flnal states of 
5*. The spectrum decompositions of the Hamiltonians of the heat baths are given by 
H'''' = Ea, K'MM- We then have 

Pi(a) = pfia')l[expi(3kiFk - <'^-)), 

J" (190) 
Po{b) = pf{b')l[exp{/3k{F, - El')), 

k 

where a = (a', {a^}) and b = {b',{bk}). We note that \ipa) = 1"^^/) ^k Ifa,,) and 
= li^b') ®fc Iv^ftfc)- Therefore, the stochastic entropy production is given by 

a{a, b) = st{b') - Si(a') - f^kQk{ak, bk), (191) 
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where 

Si(a') := -lnpf(a'), Sf(6') := -lnpf(6') (192) 
are called the stochastic entropies of S, and 

QkiatM) ■■= E^l: - (193) 

is the heat absorbed by S from Bk- We note that 

(.i) := Y.p{aMs,W) = -5^pf(a')lnpf(a') = 5(pf), 

;i ^ (194) 

(.f) := ^p(a,6).f(6') = -Y,vKb')\npl{b') = 5(pf), 

and 

{Qk) := J]p(a, 6)Qfc(afc, h) = tr [ff^'=pi] - tr [H^'^pi] , (195) 

which is consistent with Eq. f llSip . Therefore, we obtain 

{a) = AS-J2l^k{Qk), (196) 

k 

where AS := S{pf) — S{pf). Inequality ( 11711) reduces to the Clausius inequality (11521) 
in this situation. 

We note that the quantum fluctuation theorem has been generalized to nonunitary 
processes including quantum measurements j51j . 

5.4 From the Monotonicity of the Quantum Relative En- 
tropy 

We next apply the monotonicity of the quantum relative entropy to a derivation of the 
second law of thermodynamics. While the obtained inequalities are mathematically 
not equivalent to the inequalities obtained in the previous two sections, their physical 
meanings are the same for special cases. The inequalities in this subsection can also 
be applied to transitions between nonequilibrium steady states, which leads to a 
quantum version of the Hatano-Sasa inequality |62J. 

5.4.1 Time-independent Control 

We flrst consider relaxation processes to steady states, in which the external param- 
eters that we can control do not depend on time. The following theorem plays a key 
role. 
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Theorem 5.4 Let £ : L{H) L{H) be a CPTP map and p; G Q{H) be an initial 

state. We assume that S has a unique steady state satisfying £{pss) = Pss- Then 

AS > -ag, (197) 

where AS" := S{£{pi)) — S{pi) and 

aZ := tr[£:(pi) InpJ - tr[pilnpj. (198) 

Proof. Inequahty (11971) is obvious from the monotonicity of the relative entropy: 
•^(pillpss) > 5'(£:(pi)||£:(pss)) with S{pss) = Pss- □ 

We apply inequality (I197p to relaxation processes to equilibrium states. 

Example 5.5 Suppose that pss = I/d holds, where / is the identity on H and d 
is the dimension of H. In this case, = holds, and therefore inequality (11971) 
reduces to 

AS > 0, (199) 



which is equivalent to inequality (I66i) . 

Physically, the condition of S{I) = I implies that the steady state is the micro- 
canonical distribution. In fact, the microcanonical distribution is given by Pmicrocan : = 
I/d, where H is taken as the set of state vectors in a microcanonical energy shell. 
Thus, inequality (11991) is regarded as the law of entropy increase for adiabatic pro- 
cesses, in which the system does not exchange the energy with the environment so 
that the steady state is expected to be the microcanonical distribution. 

Example 5.6 Suppose that pss = e^^^"^-* holds, where H is the Hamiltonian and 
F := — lntr[e~^^] is the corresponding free energy. In this case, is given by 

ag = -PitT[H£ip;)] - tT[Hp,]) = -/3(Q), (200) 

where Q is the heat absorbed by the system. Therefore, inequality (11971) reduces 

to [S21E3] 

AS > (3Q, (201) 

which is the Clausius inequality with a single heat bath. 

We note that inequality (I197P can be applied to situations in which pss describes 
a nonequilibrium steady state (NESS). However, in such a case, the physical meaning 
of ag is not so clear. For the case of a classical overdamped Langevin system, Hatano 
and Sasa [M] showed that — o"^ can be regarded as an "excess heat," which is obtained 
by subtracting a "housekeeping heat" from the total heat. The housekeeping heat 
means the heat current in a NESS, which vanishes for the case of an equilibrium 
steady state. 
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5.4.2 Time-dependent Control 

We next consider situations in which we drive a system by changing external param- 
eters. We assume that we change the values of the parameters — 1 times during 
the entire time evolution. In such a case, the total time evolution £ can be written 

as 

^ = O ■ ■ • 0^2 o^i, (202) 

where £n '■ L{H) — L{H) {n = 1,2, ■ ■ ■ , N) describes the time evolution with the 
nth values of the external parameters. We assume that each Sn has a unique steady 
state Pss,n- We write pn '■= Sn{Pn-i) and po '■= Pi, where pi G Q{H) is the initial 
state. From inequality ( 11971) . we have a set of inequalities 

5'(p„+i) - 5'(p„) > - (tr[p„lnpss,n] - tr[p„_i lnpss,„]) {l<n<N). (203) 

By summing up them, we have the following theorem. 

Theorem 5.5 (Quantum Hatano-Sasa inequality) 

AS > -al, (204) 

where 

TV 

•= (^^[Z'" lnpss,n] - tr[p„_i lnpss,„]) . (205) 

n=l 

If Pss,n's are out of equilibrium, inequality (12041) is regarded as a quantum version 
of the Hatano-Sasa inequality, which was obtained by Yukawa [62]. We note that 
Eq. (12051) can be rewritten as 

iV-l 

= tr[pArlnpss,Ar] - tr [po In Pss,i] - ^ (tr[p„ lnpss,„+i] - tr[p„ Inp^sJ) . (206) 

n=l 

If the initial and final states are the steady states such that po = Pss,i and pat = 
Pss,iv, we have cr^ = -AS - EJ^i^ (tr[Pn lnpss,n+i] - tr[p„ lnpss,„]). In this case, 
inequality (I197p reduces to 

Af-l 

(tr[p„lnpss,„+i] - tr[p„lnpss,„]) < 0. (207) 

n=l 

If Pss.n's are equilibrium states, we again have the second law of thermodynamics as 
follows. 

Example 5.7 Suppose that pss,n = e^^^"~-^"^ holds, where Hn is the Hamiltonian 
during £n and F„ := — lntr[e~^^"] is the corresponding free energy. In this case, 
cr^ is given by 

N 

^o. = -P J](tr[i/„,p„] - ti[HnPn-i]) =■■ -f3{Q), (208) 

n=l 
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where (Q) is the heat absorbed by the system. Therefore, inequahty (11971) again 
reduces to 

AS > I3{Q). (209) 

We note that inequahty f l209p is not equivalent to inequahty fll52p with a single heat 
bath, because their setting are mathematically different. However, their physical 
meanings are physically the same; the both describe the Clausius inequality in the 
presence of a single heat bath, where entropy S is identified to the von Neumann 
entropy. We note that 

N-l 

Pss,n+1 ] - tr[p„lnpss,n]) 

n=i y^^^^) 

= (3{AF-{W)), 
where AF := Fjy — Fi is the free-energy difference, and 

N-l 

(W) := J2 i^ApnHn+l] - tT[pnH„]) (211) 



n=l 



is the work performed on the system. If the initial and final states are the canonical 
distributions such that po = Pss,i and pn = pss,N, inequality (12071) leads to 

{W) > AF. (212) 

We note that inequality (12121) still holds when the final state pn is out of equilibrium. 



6 Second Law with Quantum Feedback Control 

We next discuss a generalization of the second law of thermodynamics with quantum 
feedback control [761 - I9T] . based on the positivity and the monotonicity in parallel 
to Sec. 5. Here, quantum feedback control |106] means that the control protocol 
on a system depends on an outcome of a quantum measurement on the system. 
The obtained generalization includes the QC-mutual information. We note that any 
derivation based on the quantum fluctuation theorem has not been know so far except 
for a special case [871 [89], while the generalized second law with classical feedback 
control has been obtained from a generalized fluctuation theorem |114H119] . 



6.1 From the Positivity of the Quantum Relative Entropy 

We first derive a generalization of Theorem 5.2 with quantum feedback control. Let 
Pi G Q{H) be the initial state of the system. We first perform a unitary operation 
U so that the system evolves to p' := UpiW . We next perform a quantum measure- 
ment with the set of Kraus operators {Mbj^g^ C L{H), where B is the finite set of 
outcomes. The corresponding POVM {Eh}bi=B is assumed to be given by 

Eh = mIm,. (213) 
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The probability of outcome b is p{b) = tT[Ehp'], and the post-measurement state with 
outcome b is p'{b) := Mf,p'Ml/p{b). The QC-mutual information for this measurement 
is given by 

b 

We then perform a unitary operation Uf, that depends on outcome b, which is the 
feedback control. The final state with outcome b is pf (&) := Ubp'{b)Ul, whose ensemble 
average is pf := J^b^bMbUpiW mIuI- In this setup, Theorem 5.2 is generalized to 
the following theorem. 

Theorem 6.1 Let po{b) be a reference state corresponding to outcome b. Then 

- 5^p(6)tr[pf(&) lnpo(&)] - S{pO = 5^p(6)5(pf(&)l|Po(&)) - Iqc (215) 

b b 

holds, which leads to 

- 5^p(6)tr[pf(6) lnpo(6)] - S{p,) > -Iqc- (216) 

b 

Proof. We straightforwardly calculate that 

J2p(b)s{pm\Po{b)) 

b 

= -J2p(b) (Sipm + tr[pf(6) lnpo(6)]) 

b 

= - 5^p(6)tr[pf(&) lnpo(6)] - Sip,) + Sip,) - 5^p(6)5(pf(6)) (217) 

b b 

= - 5^p(6)tr[pf(&) Inpo(fe)] - Sip,) + Sip') - J2pib)Sip'ib)) 

b b 

= -J^PibMpiib) lnpo(6)] - Sip,) + /qc, 
b 

which implies Eq. (12151) . □ 

Example 6.1 We apply the setup of Example 5.1 to the above theorem. Let H, and 
Hfib) be the initial and final Hamiltonians, where the final one can depend on outcome 
b. We assume that the initial state is the canonical distribution p, = e^^^^~^'^ with 
F, := —f3~^ lntr[e~^''^'], and that the reference state is also the canonical distribution 
po(6) = e'^{Fi{b)-Hdb)) with Fiib) := -/3-Mntr[e-^^t(^)]. We then have 

-J2pib)tr[p,ib)\npoib)] - Sip,) = ^{W - AF), (218) 

b 

where 

(W) := 5^p(6)tr[i7f(6)pf(6)] - tT[H,p,] (219) 
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is the average of the work performed on the system, and 

{AF):=J2pib)F(ib)-F, (220) 

b 

is the average of the free-energy difference. Here, we assumed that the energy change 
of the system during the measurement is the work. Physically, this assumption implies 
that the measurement process is adiabatic. Inequality fl216p then reduces to 

P{W-AF) > -Jqc, (221) 

which is the generalization of inequality (I142p to feedback-controlled processes. 

Inequality ( 1221 p identifies the upper bound of the sum of the extractable work 
— (W) and the free-energy gain (AF) , which is proportional to the QC- mutual infor- 
mation as (3~^Iqc- In the classical limit, the upper bound is given by the classical 
mutual information. The equality in (122 ip is achieved by a classical model called the 
Szilard engine |12U] . where Iqc = In 2, (W) = (3~^ In 2, and (AF) = 0. Several models 
that achieves the equahty in (122 ip have been discussed for both classical |119yi21pl22] 
and quantum [H] regimes. We note that inequality (122 ip was first obtained by Sagawa 
and Ueda [83] . 

We note that, in a similar manner to the above example, we can apply the setup 
of Example. 5.2 to Theorem 6.1, and obtain 

AS-Y,f3k{Qk)>-lQc. (222) 

k 

6.2 From the Monotonicity of the Quantum Relative En- 
tropy 

We next derive a generalization of the quantum Hatano-Sasa inequality (I204p (Theo- 
rem 5.5) with quantum feedback control, based on the monotonicity of the quantum 
relative entropy. Let pi G Q{H) be the initial state. We control the system through 
external parameters without feedback. The system then evolves by a CPTP map 
■ ■ ■ o ^^2° ^1, where each {n = 1,2, ■ ■ ■ ,m) has a unique steady state Pss,n- We 
define 

m 

a£(l m) := ^ (tr[p„ In pss.n] - tr [p„_i In pss,n]) , (223) 

n=l 

where p„ := £n{Pn-i) and po := Pi. 

We next perform a measurement on p^ with instrument {^meas}fee_B, where B is the 
finite set of possible outcomes. The corresponding Kraus representation is assumed 
to be given by 

£^LiPm) ■■= M.p^Ml, (224) 

where M^'s are Kraus operators. We note that £^meas is trace-preserving and that 
Y^b^l^h = I holds with the identity / G L{H). The corresponding POVM {E^jbeB 
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satisfies Eb = mIMi, for any b. The probability of obtaining outcome b is p{b) = 
tT[EbPm], and the post-measurement state with outcome b is pm{b) := MbPmMl / p{b) . 
The QC-mutual information for this measurement is given by 

/qC = ^(Prn) - (225) 
b 

We define cx^ = during the measurement. Physically, this definition implies that 
the measurement process is assumed to be adiabatic, which is consistent with the 
assumption of the adiabaticity in Example 6.1. 

We next perform feedback control with a CPTP map S^'' o ■ ■ ■ o £^^^2 ° ^m+i that 
depends on outcome b. We assume that each CPTP map (n = m + 1, ■ ■ ■ , N) 
has a unique steady state pil]n- We define 

N 

aZim + l-,N;b):= (tr [p^ In p^J - tr [pi^l, In p^j) , (226) 

n=m+l 

where pn^ := Sn\pn~i) {n = m + 1, ■ ■ ■ ,N), and define 

aZ ■■= aZil -> m) + 5^p(6)af,(m + 1^N; b). (227) 

b 

We note that the ensemble average of the time evolution is given by 

S = Y,^N ° ■ ■ ■ ° ^nXl° ^^L° ^mO ■ ■ ■ O S20 (228) 
b 

which is CPTP. We write pf := S{pi) = ^bP^^)P^N ■ Then, the quantum Hatano-Sasa 
inequality fl204p is generalized to the following theorem. 

Theorem 6.2 In the above setup, 

Y.P^^^^^Pn) - SiPi) > -^Z - /qc (229) 

Proof. We have a set of inequalities from the monotonicity of the relative entropy: 
Sipn+i) - S{pn) > - (tr[p„lnpss,n] - tr [p„_i lu Pss,„] ) {l<n< m), (230) 

Sip^l) - S{p^^) > (tr[pW Inpgj - tr[pi^!, Inp^j) (m + 1 < n < iV). (231) 
By combining them to Eq. (12251) . we obtain inequality (12291) . □ 

Inequality (I229p has been obtained in this article for the first time. On the other 
hand, the classical Hatano-Sasa equality and inequality have been generalized to 
feedback-controlled classical systems ^SHHSD]- 
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Example 6.2 We apply the setup in Example 5.7 to Theorem 6.2. Let Hn be the 
Hamiltonian corresponding to Sn and := — lntr[e~^^"] be the free energy 
(1 < n < N). We note that Hn and F„ can depend on outcome b for n > m + 1, which 
are denoted as Hnib) and Fnip). We assume that the steady states are the canonical 
distributions: pss,n = e^^-^""-^") (n < m) and pss^n = e'^^^"^''^'^"^''^^ {n > m + 1). In 
this case, cr^ is given by 

m 

""c. = -(^Yl (^^l^npn] - tT[H„,Pn-l]) 

(232) 



n=l 

N 



-/^Ep(^) E MHn{b)p^{b)]-tr[Hn{b)pn-i{b)]) 

b n=m+l 

where (Q) is the average of the heat absorbed by the system. Here, we assumed that 
the measurement process is adiabatic such that (Q) = holds during the measure- 
ment, which is consistent with the definition = during f^mias. Inequality (12291) 
then reduces to 

(AS)>/3(g)-/Qc, (233) 

where (AS) := EbP(&)5(p!J^) " ^(Pi)- 

We further assume that the initial and final states are the canonical distributions 
such that pi = pss,i and p^^ = pfjj^- We then have 

{AS) = /3{AE - AF), (234) 

where 

{AE) := J2p(bMH^N^P^N^] - ^AHm] (235) 

6 

is the energy difference of the system, and (AF) := ^i,p{b)FN{b) — Fi. Therefore, 
we obtain from inequality (12331) that 

P{W - AF) > -Jqc, (236) 

where we defined (W) := (AE) — (Q) due to the first law of thermodynamics. While 
inequality (I236P is mathematically not equivalent to inequality (I22ip . their physical 
meanings are the same. We note that inequality (I236P still holds when p^^^s are out 
of equilibrium. 



7 Concluding Remarks 

We have discussed several second law-like inequalities based on the positivity and 
the monotonicity of the quantum relative entropy. In Sec. 2, we discussed the basic 
concepts for quantum states and dynamics. In particular, we discussed the complete 
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positivity of time evolutions, and derived the Kraus representation (12CT]) . In Sec. 3, we 
introduced the von Neumann entropy and the quantum relative entropy, and discussed 
their basic properties. In particular, we proved the positivity of the quantum relative 
entropy. We also discussed the monotonicity, which is proved in Appendix B. In 
Sec. 4, we have discussed the quantum mutual information and two related quantities. 
One is the Holevo's x-Quantity and the other is the QC-mutual information. The 
two quantities characterize the accessible classical information encoded in quantum 
states, through the Holevo bound fll02p and its dual inequality ( 11211) . We also showed 
the data processing inequalities (177|) and ( 198|) . which are direct consequences of the 
monotonicity of the quantum relative entropy. 

Sections 5 is devoted to the derivations of the second law of thermodynamics and 
its generalization. In Sec. 5.1, we discussed that the von Neumann entropy and the 
thermodynamic entropy can be identified in canonical distributions. In Sec. 5.2, we 
derived a general inequality fll33|) based on the positivity. Inequality fll33p leads to a 
well-known expression of the second law of thermodynamics f ll52p (i.e., the Clausius 
inequality) as a special case, by treating the total system including the baths as a uni- 
tary system. In Sec. 5.3, we discussed the quantum fluctuation theorem, which leads 
to the second law of thermodynamics fll7ip that is equivalent to inequality fll33p in 
Sec. 5.2. In Sec. 5.4, we derived a general inequahty fl204p based on the monotonicity, 
which is regarded as a quantum version of the Hatano-Sasa inequality. This inequality 
leads to the second law of thermodynamics fl209p for a special case. Our derivations 
are independent of the size of the system, and therefore the obtained inequalities can 
be applied to small thermodynamic systems such as quantum dots. 

We now discuss the physical meanings of the results in more detail. We first 
compare the two derivations of the second law, which are based on the positivity 
and the monotonicity. The derivation of inequality fll52p based on the positivity is 
quite universal, because we have essentially made only three assumptions: (i) the 
total system including the baths obeys a unitary evolution, (ii) the baths are initially 
in the canonical distributions, and (iii) the system is initially not correlated to the 
baths. We stress that we did not assume anything about the intermediate and the 
final states of the total system. The assumptions (i) and (ii) would be physically 
reasonable. The assumption (iii) can be justified if the system is initially separated 
from the baths. 

On the other hand, in the derivation of inequality (1201 p based on the monotonicity, 
any bath is not explicitly included in our formulation, and therefore time evolutions 
are assumed to be nonunitary. In this case, the derivation requires a strong assump- 
tion that (iv) the steady state of each CPTP map £n is a canonical distribution. We 
stress that the assumption (iv) is not satisfied in general if we trace out the bath from 
the total system satisfying (i)-(iii). Moreover, the derivation based on the monotonic- 
ity implicitly requires that (v) the outside bath needs to be refreshed (i.e., needs to 
be replaced by a new bath that is not correlated with the system) when the exter- 
nal parameters are changed, because each is assumed to be CPTP. On the other 
hand, the monotonicity can be apphed to the cases in which the steady states are 



45 



not in thermal equilibrium, which is an advantage of the method of the monotonic- 
ity. We note that the derivation based on the monotonicity can be generalized to 
continuous-time systems described by quantum master equations [63 |I106[IT23|I124] . 

Going back to the assumption (iii) for the derivation based on the positivity, 
it has been pointed out that if the system is initially correlated with the baths, 
inequality f ll52p needs modifications |125H127] . For example, in the special case of 
inequality fll59p . the free energy of the system needs to be renormalized [50] • On the 
other hand, the role of the initial correlation has been discussed in terms of the origin 
of the arrow of the time [T28l[T29] . 

The assumption (ii) can be criticized in terms of the foundation of statistical 
mechanics. In macroscopic systems, a thermal equilibrium state does not necessar- 
ily correspond to any canonical distribution from the microscopic point of view. In 
fact, it has been shown that even a pure state can behave as a thermal equilibrium 
state |130H138j . Therefore, the assumption (ii) is difficult to be rigorously justified 
for macroscopic baths. The identification of the von Neumann entropy to the ther- 
modynamic entropy fails in general, except for the cases in which a thermal equilib- 
rium state corresponds to a canonical distribution. Experimentally, trapped ultracold 
atoms can relax to a thermal equilibrium state even if they are well separated from 
the environment. We note that the detailed investigations of the relaxation processes 
of isolated quantum systems are now experimentally accessible [139] . 

We next briefly mention a theoretical approach to derive the second law with- 
out assuming the initial canonical distribution |14Uf[143] . Let p G Q{H) be an 
initial state. We consider a unitary evolution U from t = to r with a time- 
dependent Hamiltonian H{t). We define the work performed on the system as 
iy/) ■= ti[H{T)UpW] - ii[H{Q)p\. We also define the quasi-static work (ly)siow : = 
limT-_s.oo (W^) , where i^(0) and H{t) are fixed. Let i^(0) = Ylik {4^k\ be the 

spectrum decomposition of the initial Hamiltonian. The crucial assumption on the 
initial state is that p is written as p = X]A:P(^)l^fc)(^fc| with p{k)^s satisfying 

p{k)>p{k') if Ek<Ek>. (237) 

Then, Lenard [140] proved that 

{W) > (ly)siow, (238) 

which is regarded as an expression of the second law of thermodynamics (see also 
Ref. [142] ). We note that the quasi-static work can be identified to the free-energy 
difference as (W^)siow = 

Without assuming 02371) . inequality fl238p is not satisfied in general. For example, 
the microcanonical distribution does not satisfy assumption fl237p . if the entire Hilbert 
space is spanned by all \ipk)^s and is not restricted to the microcanonical energy 
shell. In fact, several counterexamples against inequahty (12381) have been discussed in 
classical systems with the microcanonical initial distribution |144H146] . We note that 
a generalized quantum fluctuation theorem for microcanonical initial distributions 
has been discussed [147j . 
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The role of a quantum coherence in the initial state has been studied in terms 
of quantum heat engines |148H154] . For example, if the initial state differs from the 
canonical distribution and involves an additional quantum coherence, the second law 
of thermodynamics like f l238p is not necessarily satisfied. 

We now reach an observation that the choice of the initial state is crucial to derive 
the second law. In fact, the canonical distribution is a special state that maximizes 
its von Neumann entropy among the states that have the same amount of the energy. 
This special initial condition is a crucial reason why we can derive the second law 
even if the dynamics of the total system is reversible. It would be difficult to rigor- 
ously justify the physical validity of the initial canonical distribution. Therefore, the 
derivations of the second law in Sec. 5 are not so satisfactory to understand the fun- 
damental reason why the macroscopic world is irreversible in spite of the reversibility 
of the microscopic dynamics. To understand the origin of the arrow of the time would 
be an interesting but quite difficult future challenge. 

On the other hand, thermodynamics of information processing is also an interest- 
ing topic. While it would be easier than the aforementioned fundamental problem, it 
is also closely related to the foundation of thermodynamics and statistical mechan- 
ics. This topic has been discussed by numerous researchers in terms of the paradox 
of "Maxwell's demon" |108H111] . The demon can be formulated as an information 
processing device, and has been shown to be consistent with the second law of ther- 
modynamics |110H113] . Recently, modern nonequilibrium statistical mechanics and 
quantum information theory have shed new light on the theory of thermodynamics 
including the demon [75mn [TIMlTm[m[IT^2j . 

In particular, we discussed thermodynamics of quantum feedback control in Sec. 6. 
We obtained inequalities fl216p and fl229p as generalizations of fll33p and f l204p . re- 
spectively. The obtained inequalities lead to generalizations of the second law fl22ip 
and ( 12331) . We note that the quantum fluctuation theorem with quantum information 
processing has been elusive except for special cases [SZIES]- 

A Short Summary of the Linear Algebra 

In this appendix, we briefly summarize the concepts and notations of the linear algebra 
for finite-dimensional Hilbert spaces. Let H he a. A^-dimensional Hilbert space with 

< oo. The inner product of \(f), |0) & H is written as (v^l^) or {if,4>). The norm 
of the Hilbert space is given by Hv^lP = (v^|v^)- A linear basis {\'^k)}k=i C H is called 
an orthonormal basis if it satisfies {(pkl'fii) = Sm with 6ki the Kronecker delta. 

Let L{H, H') be the set of linear operators from H to another Hilbert space 
H'. In particular, we write L{H) := L{H,H). We write {(f\X\(j)) := {(f,X(f)) for 
X G L[H). For any X G L{H, H'), there exists a unique operator G L{H', H) 
that satisfies 

(0,X^) = (XV,^) (239) 

for any G H and |0) G H' . is called the adjoint or the Hermitian conjugate 
operator of X. If = X holds for X G L{H), X is called self-adjoint or Hermitian. 
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We note that any operator X G L{H) can be written as X = Xi + 1X2, where 
Xi := (X + Xt)/2 and X2 := (X - Xt)/(2i) are Hermitian. 

Let H' be a subspace of H with dimension N' (< N). Let {\ipk)}k=i be an 
orthonormal basis of H'. The projection operator onto H' is given by 

N' 

Ph' = J2\^^)(^^\^^W- (240) 

A:=l 

In particular, any orthonormal basis satisfies 

N 

J2\V'')(^k\=I, (241) 

k=l 

where / is the identity operator on H. On the other hand, \^p){^p\ with H^^H = 1 is 
the projection operator onto the 1-dimensional space that is spanned by \ip). Any 
Hermitian operator X has the spectrum decomposition 

X = Y,^k\¥^k){¥^k\, (242) 

k 

where G M is an eigenvalue and {(fk) is an eigenvector that constitutes an orthonor- 
mal basis {|</?A;)} C H. The support of X is the subspace of H that is spanned by 
|</9fc)'s with nonzero eigenvalues. 

If {ip\X\(p) > holds for any \ip) G H, X G L{H) is called positive. Any positive 
operator is Hermitian, because {(p\X2\(p) should be zero for any \(p) with X2 defined 
above. Any eigenvalue of a positive operator is non-negative. We write X > F if 
X — y is positive. In particular, X > if X is positive. If {ip\X\(p) > holds for any 
nonzero \ip) G H, X G L{H) is called positive definite and is written as X > 0. Any 
positive-definite operator is Hermitian with positive eigenvalues. We note that the 
support of a positive-definite operator equals to H. We also note that V G L{H, H') 
is called a contraction, if ((^|y''V^|(^) < (v^lv^) holds for any \ip) G H. 

The trace of X G L{H) is defined as 

tr[X] := ^(<^,|X|<^fc), (243) 

k 

where {Iv^/t)} C i? is an orthonormal basis. We note that tr[X] is independent of 
the choice of the orthonormal basis. If the spectrum decomposition of X is given by 
Eq. fl2l2|) . its trace is given by tr[X] = Y^k^k- Let X G L(/f, H') and Y G L{H\ H). 
We then have 

tr[rX] = tr[XF], (244) 
where the left and right traces are on H and H' , respectively. In fact, 

tr[yX] = Y,^<Pu\Y\iJi){iJi\X\(t>k) = Y,{MX\(l>k){<Pk\Y\iJi) = tr[Xy], (245) 

kl kl 
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where {|0a;)} and are respectively orthonormal bases of H and H' . 

Let H and H' be Hilbert spaces with dimensions and N' , respectively. For any 
X G L{H,H'), there exist orthonormal bases {1^?^)}^;^ C H and {\ipk)}k=i H' 
such that 

N" 

X = J2^k\i^k){Vk\, (246) 

k=l 

where > (1 < A; < A^") and A^" := min{A^,A^'}. Equality f l246|l is called the 
singular value decomposition, and is called a singular value of X. 

We next consider the tensor product of two Hilbert spaces. Let Ha and Hb be 
two Hilbert spaces, and Ha ® Hb be their tensor product. For simplicity, we write 
\{Pa) ® I^Pb) G Ha®Hb as \({)a)\^b)- Let {Iv^'fc)} and {I'?/';)} be orthonormal bases of 
Ha and Hb, respectively. Any vector G Ha ® Hb can be written as 

\^) = Y,o^kiWMi), (247) 

kl 

where aki € C. By applying the singular- value decomposition to matrix (a^i), we 
find that there are orthonormal bases {Iv^'^)} C Ha and {IV';')} C Hb such that 

k 

where A^ > is a singular value of matrix (0;^;). Equality fl248p is called the Schmidt 
decomposition of 

B Proof of the Monotonicity of the Quantum Rel- 
ative Entropy 

In this appendix, we prove the monotonicity (1641) of the quantum relative entropy 
(Theorem 3.5) in line with Petz's proof |105j . We first prove some lemmas, in which 
the key concepts are the operator monotonicity and the operator convexity |155j . 

Let be a finite-dimensional Hilbert space and X G L{H) be a positive-definite 
operator with spectrum decomposition X = J2k-'^k\'^k){Vk\- We can substitute X to 
a function / : (0, 00) — )■ M as 

fiX):=J2fi^k)\^k){^k\. (249) 

k 

We then introduce the following concepts. 

Definition B.l / is called decreasing-operator monotone, if /(X) > f{Y) holds for 
any positive-definite operators X, y G L{H) satisfying X <Y . 

Definition B.2 / is called operator convex, if /(pX+(l—p)F) <pf{X) + {l—p)f(Y) 
holds for any < p < 1 and any positive-definite operators X, F G L{H). 
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In general, it is difficult to judge whether a function is operator monotone and is 
operator convex. However, it is not so difficult to show that f{x) = (x + (t > 0) 
and f{x) = —\nx are both decreasing-operator monotone and operator convex. 

Lemma B.l f{x) = (x + t)^^ {t > 0) is decreasing-operator monotone. 

Proof. It is obvious that X < I ^ X^^ > I with I e H the identity. We then 

have r-i/2xr-i/2 < / ^ yi/2^-iyi/2 > therefore X <Y ^ X'^ > Y-\ 

Since X<Y^X + tI<Y + tI,we obtain X <Y ^ {X + tl)-^ > (F + tI)-\ □ 

Lemma B.2 f{x) = (x + t)^^ (t > 0) is operator convex. 

Proof. Since is convex, {pX + (1 —p)I) < pX^^ + (1 —p)I holds. We then have 
+ < pY^/^X^^Y^/^ + {l-p)I, and therefore {pX+{l-p)Y) < 

pX-^ + {l-p)Y-\ By noting that {pX + {l-p)Y + tI) = {p{X + tI) + {l-p){Y+tI)), 
we obtain {pX + {1 - p)Y + tl)'^ < p{X + tiy^ + {1 - p){Y + tI)-\ □ 

Lemma B.3 f{x) := — Inx is decreasing-operator monotone and operator convex. 

Proof. It follows from that - Inx = {{x + t)-^ - (1 + t)-^) dt. □ 

It is known that any decreasing-operator monotone function is operator con- 
vex [156j . We next show an important property of operator convex functions. 

Lemma B.4 (Jensen's operator inequality |157HT^^j ) If / is operator convex, 

fiy^XV) < V^f{X)V (250) 

holds for any contraction V G L{H, H') and any positive-definite operator X G 
L{H') such that V^XV is also positive definiteH 



Proof. We define 



X':-- 
Vi : -- 
V2:-- 



X 




G L{H'®H,H'®H) 



V (/-l^\/t)l/2 

V -(/-Wt)l/2 

(/-\/tv^)i/2 



G L{H ®H',H' ®H), 
G L{H®H',H'®H), 



(251) 



where we used the assumption that V is a contraction to define (/ — V'^VY^'^ and 
(/ — VV'^Y^'^. By noting that H © H' ~ H' © H and by using the singular- value 
decomposition of V, it is easy to check that Vi and V2 are unitary. We have 



V^^X'Vi 



V^XV v^xu 
uxv uxu 



, V^X'V2 



V^XV -V^XU 

-uxv uxu 



(252) 



The definition of a contraction is given in Appendix A. 
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where U := (I - VV^y/^, and 

V^X'Vi + V^X'V2 



V^XV 

uxu 



(253) 



Therefore, by using the operator convexity of /, we obtain 



f{V^XV) 
f{UXU) 

VU\X')V^ + VU\X')V2 



^ I vlx'v. + v^x'v^ ^ ^ fiylx'v^) + fiv^x'v^i 



V^f{X)V 
Uf{X)U 



(254) 



which imphes inequahty fl250p . □ 



Conversely, it is known that [158J if inequahty fl250p is satisfied for any contraction 
V e L{H,H') and any positive-definite operator X G L{H'), then / is operator 
convex. 

We next consider a generahzation of the Schwarz inequahty. 



Lemma B.5 (Kadison inequality [160] ) Let X E H he a Hermitian operator 
and £ : L(H) — )• L[H') be a unital positive map|§ Then 



(255) 



Proof. Let X := Xl^i ^fc|v'fc)(v'fc| be the spectrum decomposition of X, where N 
is the dimension of H . We define X^ '■= S{\ipk){^Pk\), which satisfies X^ > due to 
the positivity of £ and J2k-^k = I due to assumption S{I) = I. Inequality (I255p can 
then be written as 



^ xlXk > I ^ XkXk j 

k \ k / 



or equivalently, for any \ip) G H', 

J2{^\xlX,\^p) > {z\z), 



(256) 



(257) 



where \z) := Ylk ^kXk\ip)- We introduce auxiliary system that has an orthonormal 
basis {\ek)}k=v define an inner product (■, ■)k in H' (g) such that 



^\^k)\ek),^\^k)\ek) := ^((^fclXfelV'fc). 



(258) 



^The definition of a unital map is given in Sec. 2.2.2. 



51 



By using the Schwarz inequality for the inner product (■, ■)k, we have 
{z\z) = '^{z\xkXk\ip) 



K 



\ k k / K \ k ^ / 



1/2 



K 



(259) 



1/2 



1/2 



\ k 



1/2 



\4Xk\ 



which imphes inequahty (125 7p . □ 

We note that we did not assume the complete positivity of £ for the Kadison 
inequality. The following lemma is a straightforward consequence: 

Lemma B.6 (Schwarz's operator inequality |161] ) Let X G L{H) be an arbi- 
trary operator and £ : L{H) — )■ L{H') be a unital 2-positive map. Then 



£{X^X) > £{X^)£{X). 



(260) 



Proof. By applying the Kadison inequality (I255p to positive map £ ® X2 and a 
Hermitian operator 



X' :-- 



xt 

X 



G L{H®C 



we obtain {£ ®T2){X'^) > ((^ O X2)(X')) , or equivalently 



£{X^X) 
£{XX^) 



> 



£{X^)£{X) 

£{X)£{X^) 



(261) 



(262) 



which implies inequality fl260p . □ 



We now prove the monotonicity flM|) in Theorem 3.5. Let £ : L{H) — t- L{H') be 
a CPTP map and p,(J & Q{H) be states. For simphcity, we assume that p, a, £{p), 
and £{o-) are positive definite. We define £, 71, and V that act on L{H) such that, 
for X G L{H), 

C{X) := aX, 7^(X) := Xp-\ ©(X) := (rXp~\ (263) 

where 15 = CTZ = IZC We note that £, 7?., and V are positive definite in terms of 
the Hilbert-Schmidt inner product 
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Let cr := '^j.PkWk){'~Pk\ be the spectrum decomposition. Then the eigenvectors of 
C are {\Vk){vi\}ki and the eigenvalues are {pk}k- We then have (ln£)(X) = {\n.a)X. 
Similarly, {\n.TZ){X) = — X(lnp). Therefore, 

(lnP)(X) = (ln£ + ln7^)(X) = (Ina)X - X(lnp). (264) 

By using the Hilbert-Schmidt inner product, we have 



S{p\W) = {p"\ (lnp)p^/^)HS - (//^ (lna)//2)Hs 
= (//^(-lnP)pV2)Hs. 



(265) 



On the other hand, 

S{£{p)\\S{a)) = {S{py/',{-\nV'mpy/'))ns, (266) 
where V acts on L{H') such that 

V'{X) := S{a)X£{p)-^ {X G L{H')). (267) 
We next define V : L{H') L{H) such that 

V(X) := £t (X£(p)-i/2) (268) 

or equivalently 

V (X^(p)'/') = ^^(X)p^/2 (269) 

for X E L(H'). We note that V{£{pY^'^) = p^^^ holds, because £ is trace-preserving 
so that £^ is unital. We then have 



Sip\\a) = {Vi£ipy/'),i-\nV)Vi£ipY/'))ns 
= {£{pY/',V\-\nV)V{£{pY/'))ns. 



(270) 



Therefore, our goal is to show that 

(f(p)V2,vt(-lnP)V(£:(p)^/2))HS > {£ip)'/M-\nV')i£ipY/'))ns. (271) 

To show inequality (12711) . it is enough to show that 

V^(-lnr')V > -InP'. (272) 

We then show that V is a contraction in terms of the Hilbert-Schmidt inner prod- 
uct. In fact, 

{£\X)p'/',£\X)p'/')ns = tr [p£\X^)£\X)] 
< tr [p£^{X^X)] = tr [£{p)X^X] (273) 

= {X£ipY/',X£{pY/')ns, 
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where we used the Schwarz inequahty fl26Up (Lemma B.6) for that is unital. Since 
— Inx is operator convex from Lemma B.3, we obtain that 

- \n{V^VV) < V^(- lnP)V, (274) 

where we used the Jensen's operator inequahty (12501) (Lemma B.4). 
We next show that 

V^VV < v. (275) 

In fact, 

= {V{XS{py/'),VV{XS{py/'))ns 

= {S\X)p'l\ Vi£\X)p'/'))us = tr [p£\X^)V£\X)] 

= tr [aS\X)S^{X^)] < tr [aS^{XX^)] = tr [S{a)XX^] ^ ' 

= tT[£{pY/'X^£ia)X£{pY/'£ip)-'] 

= {X£{pY/',V'{X£{pY/'))ns, 

where we again used the Schwarz inequahty (12601) for £''. Since — Inx is decreasing- 
operator monotone from Lemma B.3, we obtain 

-InP' < -ln(V"^PV). (277) 

By combining inequahties (I274p and (12771) . we finally obtain inequality (I272p . 
which implies the monotonicity fl64l) . We note that the assumption of the complete 
positivity has been used only for the proof of the Schwarz's operator inequality (I260p . 
in which the assumption of the 2-positivity is in fact enough. 
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